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Module-1 
INTRODUCTION 


1.1 Data Communication 


Data communications are the exchange of data between two devices via some form of 
transmission medium such as a wire cable. 

The word dara refers to information presented in whatever form is agreed upon by the 
parties creating and using the data. 

For data communications to occur, the communicating devices must be part of a 
communication system made up of a combination of hardware (physical equipment) and 


software (programs). 


The effectiveness of a data “communications, system depends on four fundamental 


characteristics: 


1. Delivery: The system must deliver data)to the correct destination. Data must be received 
by the intended device oruser and only. by that device or user. 

2. Accuracy: The system must)deliver the data accurately. Data that have been altered in 
transmission and left uncorrected)are unusable. 

3. Timeliness: The system must deliver data in a timely manner. Data delivered late are 
useless. In the case,of video and audio, timely delivery means delivering data as they are 
produced, in the)same order that they are produced, and without significant delay. This 
kind of delivery is called real-time transmission. 

4. Jitter: Jitter refers to the variation in the packet arrival time. It is the uneven delay in the 
delivery of audio or video packets. 

For example, let us assume that video packets are sent every 3D ms. If some of the 
packets arrive with 3D-ms delay and others with 4D-ms delay, an uneven quality in the 


video is the result. 
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Components of Data Communication 


A data communications system has five components 


ie Protocol Protocol 
S_F _—_S_ 








Medium 


Message: The message is the information (data) to be communicated. 

Popular forms of information include text, numbers, pictures, audio, and video. 

Sender: The sender is the device that sends the data message. 

It can be a computer, workstation, telephone handset, video camera,and so on. 

Receiver: The receiver is the device thatreceives the message. 

It can be a computer, workstation, telephone handset)television, and so on. 

Transmission medium: The ‘transmission medium is) the physical path by which a 
message travels from sender toyreceiver. 

Examples: twisted-pair wire)coaxial cable, fiber-optic cable, and radio waves. 

Protocol: A protocol is a)set of rules that govern data communications. It represents an 


agreement between the communicating devices. 


Data Representation 


Information ean be inidifferentforms like text, numbers, images, audio, and video. 


1) Text 





In data communications, text is represented as a bit pattern, which is as sequence of bits 
made up of 0’s and 1’s. 
Different sets of bit patterns have been designed to represent text symbols. Each set is 


called a code, and the process of representing symbols is called coding. 
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¢ Most Common coding system is Unicode which uses 32 bits to represent a symbol or 
character. The American Standard Code for Information Interchange (ASCII) 


constitutes the first 127 characters in Unicode. 


2) Numbers 
Numbers are also represented by bit patterns. However, a code such as ASCII is not used 
to represent numbers; the number is directly converted to a binary number to simplify 


mathematical operations. 


3) Images 

e Images are also represented by bit patterns. Image is composed of a matrix of pixels 
(picture elements), where each pixel is a small dot.The size of the pixel depends on the 
resolution. 

e After an image is divided into pixels, each pixehis assigned abit pattern. The size and the 
value of the pattern depend on the image. 

e For an image made of only black and white dots, a 1-bit pattern is enough to represent a 
pixel. 

e If an image isnot made of pure white and pure black pixels, size of the bit pattern can be 
increased to include gray,scale) For example, to show four levels of gray scale 2 bits 
pattern)can be.used. A black pixel can be represented by 00, a dark gray pixel by 01, a 
light gray pixel by 10, anda white\pixel by 11. 

¢ Color Images can be represented in several ways. One method is called RGB, where each 
color is made.of a combination of three primary colors: red, green, and blue. Another 
method is;called YEM, in which a color is made of a combination of three other primary 


colors: yellow, cyan, and magenta. 


4) Audio 
Audio refers to the recording or broadcasting of sound or music. Audio is by nature 


different from text, numbers, or images. It is continuous, not discrete. 
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5) Video 
Video refers to the recording or broadcasting of a picture or movie. Video can either be 
produced as a continuous entity, or it can be a combination of images, each a discrete 


entity, arranged to convey the idea of motion. 


Data Flow 


Communication between two devices can be simplex, half-duplex, or full-duplex. 








Direction of data 
|) 


Mainframe Monitor 

















a. Simplex 





Direction of data at time 1 
pati detail es 


Station = Station 
Direction of data at time 2 








fo 


Half-duplex 


Direction of data all the time 
<__Diction of data atthe time 5 


©. Full-duplex 




















Simplex: 
¢ simplex mode, the communication is unidirectional. Only one of the two devices on a 
link can.transmit; the other camonly receive. 


e Example: Keyboardsjand traditional monitors. 


Half-Duplex: 
e In half-duplex mode, each station can both transmit and receive, but not at the same time. 
When one device is sending, the other can only receive, and vice versa. 
e Example: Walkie-talkies. 
e The half-duplex mode is used in cases where there is no need for communication in both 
directions at the same time; the entire capacity of the channel can be utilized for each 


direction. 
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Full-Duplex: 
e In full-duplex mode (also called duplex), both stations can transmit and receive 
simultaneously. 
e Example: telephone network 
e The full-duplex mode is used when communication in both directions is required all the 


time. The capacity of the channel, however, must be divided between the two directions. 


1.2 Networks 


e A network is a set of devices (nodes) connected by communication links. 

e A node can be a computer, printer, or any other device capable. of sending and/or receiving 
data generated by other nodes on the network. 

© Most networks uses distributed processingyain which a task is divided among multiple 
computers. Instead of one single large machine being responsible,for all aspects of process, 


separate computers handle,a subset. 


Network Criteria 
The networks should meet some criteria, they, are: 
1) Performance 
e Performance can be measured using transit time and response time. 
* (Transit time is the amount of time required for a message to travel from one device 
to another. 
= Response time is the elapsed time between an inquiry and a response. 
e The performance of a network depends on a number of factors: 
= The number of users 
= The type of transmission medium 
= The capabilities of the connected hardware 
= The efficiency of the software. 
e Performance can be evaluated using throughput and delay. Throughput should be more 


and delay should be less. 
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2) Reliability 
Network reliability is measured by the frequency of failure, the time it takes a link to 
recover from a failure, and the network's robustness in a catastrophe. 

3) Security 
Network security issues include protecting data from unauthorized access, protecting data 
from damage and development, and implementing policies and procedures for recovery 


from breaches and data losses. 


Physical Structures 


Type of Connection: 

A network is two or more devices connected through links. A link is ajcommunications pathway 
that transfers data from one device to another. 

For communication to occur, two devices must be connected in som@way to the same link at the 
same time. 


There are two possible types, of connections: point-to-point and multipoint. 





a. Point-to-point 











— 
















Mainframe 














b. Multipoint 
1) Point-to-Point 
e A point-to-point connection provides a dedicated link between two devices. 


e The entire capacity of the link is reserved for transmission between those two devices. 
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2) Multipoint 
e A multipoint (also called multidrop) connection is one in which more than two specific 
devices share a single link. 
e Ina multipoint environment, the capacity of the channel is shared, either spatially or 
temporally. If several devices can use the link simultaneously, it is a spatially shared 


connection. If users must take turns, it is a fimeshared connection. 


Physical Topology 
The term physical topology refers to the way in which a network is laidjout physically. 
The topology of a network is the geometric representation of the relationshipof all the links and 


linking devices (usually called nodes) to one another. There are four basic topologies possible: 


Topology 


mesh, star, bus, and ring. 


1) Mesh Topology 


e In a mesh topology, every device has a dedicated point-to-point link to every other 


device. 








Consider a fully connected mesh network with n nodes in which each node must be connected to 
every other node. Node | must be connected to n - 1 nodes, node 2 must be connected to n — 1 
nodes, and finally node n must be connected to n - 1 nodes. Hence n(n - 1) physical links are 
required. If each physical link allows communication in both directions (duplex mode), we need 


n(n -1) /2 duplex-mode links. 





e To accommodate many links, every device on the network must have n — | input/output 
(I/O) ports to be connected to the other 7 - | stations. 
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Station 








Advantages 


The use of dedicated links guarantees that each connection can ¢arry its own data load, 
thus eliminating the traffic problems that ¢an occur when links aréyshared between 
multiple devices. 

A mesh topology is robust. If one link becomes unusable, it does not affect the entire 
system. 

There is the advantage of privacyor security. When every)message travels along a 
dedicated line, only the intended recipient sees,it. Physical boundaries prevent other users 
from gaining access to messages. 


Point-to-point links maké fault identification and fault isolation easy. 


Disadvantages 


The amount of cablingyand the.number of\I/O ports required is more because every 
device must be connected)to every other device, installation and reconnection are 
difficult. 

The sheer bulk of the wiring can be greater than the available space (in walls, ceilings, or 
floors) can accommodate. 


The hardware required to connect each link (I/O ports and cable) is expensive. 


2) Star Topology 
In a star topology, each device has a dedicated point-to-point link only to a central controller, 


usually called a hub. 
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The devices are not directly linked to one another. Star topology does not allow direct traffic 
between devices. The controller acts as an exchange: If one device wants to send data to another, 


it sends the data to the controller, which then relays the data to the other connected device. 
Hub 

















[Station p [station p [station p | Station p 


Advantages: 





e A star topology is less expensive than a mesh:topology. 

e Inastar, each device needs only one link and oneJ/O port to connect it to any number of 
others. This factor also makes it easy to install and reconfigure. 

e The robustness. If one link fails, onlythatlink is affected. All other links remain active. 

e Fault isolation and identificationjis easy. 


e As long as the hub is working, it can be used to. monitor link problems and bypass 
defective links. 


Disadvantages: 
¢ One big disadvantage ofia star topology is the dependency of the whole topology on one 


single\point, the hub, If the hub goes, down, the whole system is dead. 


3) Bus Topology 
A bus topology, on.the other hand, is multipoint. One long cable acts as a backbone to link all 


Drop line 


the devices in anetwork, 


Drop line 











Station 








Drop line 








Cable end Cable end 





Tap Tap Tap 
e Nodes are connected to the bus cable by drop lines and taps. 


e A drop line is a connection running between the device and the main cable. 
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A tap is a connector that either splices into the main cable or punctures the sheathing of a 
cable to create a contact with the metallic core. 

As a signal travels along the backbone, some of its energy is transformed into heat. 
Therefore, it becomes weaker and weaker as it travels farther and farther. For this reason 
there is a limit on the number of taps a bus can support and on the distance between those 


taps. 


Advantages: 


Ease of Installation. 
Backbone cable can be laid along the most efficient path, then connected to the nodes by 
drop lines of various lengths. In this way, a bus uses less cabling than mesh or star 


topologies. 


Disadvantage: 


Difficult reconnection and fault isolation. 

A bus is usually designed to be optimally efficient atyinstallation. It can therefore be 
difficult to add new devices. 

Signal reflection at the taps can cause degradation in quality. 

A fault or break in the bus,cable)stops all transmission, even between devices on the same 
side of the problem 

The damaged area reflects signals back in the direction of origin, creating noise in both 


directions. 


4) Ring Topology 





In a ring topology, each device has a dedicated point-to-point connection with only the 
two devices on either side of it. 

A signal is passed along the ring in one direction, from device to device, until it reaches 
its destination. 

Each device in the ring incorporates a repeater. When a device receives a signal intended 


for another device, its repeater regenerates the bits and passes them along 
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Repeater 


Station j 





Repeater 








Advantages: 


A ring is relatively easy to install and reconfigure. 


Each device is linked to only its immediate neighbors: Hence adding or deleting a device 
requires changing only two connections. 

Fault Isolation is simplified. 

If one device does not receive a signal within a specified period) it can issue an alarm. 


The alarm alerts the network operator tothe problem and its location. 


Disadvantages: 


Unidirectional traffic. 


e Ina simple ring, a break in the ring, (suchyas a disabled station) can disable the entire 
network. 
5) Hybrid Topology, 


A network can be)hybrid. For example, we can ‘have a main star topology with each branch 


connecting séyeral stations in a bus topology as shown below. 











Station J Station j Station j 
station Station j Station J 
Station J Station j Station j 


















Hub 
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Network Models 


Categories of Networks 


There are primarily two main categories. 


1) Local Area Network 


A local area network (LAN) is usually privately owned. 

LAN connects the devices in a single office, building, or campus. 

Depending on the needs of an organization and the type of technology.used, a LAN can 
be as simple with just two PCs and a printer; or it canyextend throughout a company and 
include audio and video peripherals. 

LAN size is limited to a few kilometers. 

LANs are designed to allow resources tobe shared between personal computers or 
workstations. The resources to be shared)can include hardware, software, or data. 

The most common LAN topologies are bus, ring, and star. Early LANs had data rates in the 4 
to 16 megabits per second (Mbps),range,, Today, however, speeds are normally 100 or 1000 
Mbps. 




















Hub 









































2) Wide Area Network 


A wide area network (WAN) provides long-distance transmission of data, image, audio, and 


video information over large geographic areas that may comprise a country, a continent, or even 


the whole world. 
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A WAN can be as complex as the backbones that connect the Internet or as simple as a dial-up 


line that connects a home computer to the Internet. 


S J End system 














End system End system 








a. Switched WAN 








Point-to-point 



































==] 
ooo 
— 
Modem Modem —— 


Computer ISP 














b, Point-to-point WAN 


The switched WAN connects the end systems, which usually comprise a router that connects to 
another LAN or WAN, 

The point-to-point WAN jis normally a line leased from a telephone or cable TV provider that 
connects a home computer ora smallLAN)to an Internet service provider (ISP). This type of 


WAN is often used to provide Internet access. 


3) Metropolitan Area, Networks 


A metropolitan area network (MAN) is a network with a size between a LAN and a WAN. 
It normally covers the area inside a town or a city. It is designed for customers who need a high- 
speed connectivity, normally to the Internet, and have endpoints spread over a city or part of city. 


Ex: high-speed DSL line, cable TV network. 
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Interconnection of Networks: Internetwork 
When two or more networks are connected, they become an internetwork, or internet. Below 


figure shows an heterogeneous network made off our WANs and two LANs. 


President 





Point-to-point, 
WAN 








Point-to-point 
= WAN 


Router Switched WAN 





%Point-to-point 





1.3 The Internet 


The internet is a. globalmetwork connecting millions of computers. It is a network of networks. 


Brief History 

e A networkis a.group of connected communicating devices such as computers and printers. 
An internet (note the lowercase letter i) is two or more networks that can communicate with 
each other. 

e The most notable internet is called the Internet (uppercase letter I), a collaboration of more 
than hundreds of thousands of interconnected networks. 

e In the mid-1960s, mainframe computers in research organizations were standalone devices. 


Computers from different manufacturers were unable to communicate with one another. 
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e The Advanced Research Projects Agency (ARPA) in the Department of Defense (DoD) was 
interested in finding a way to connect computers so that the researchers they funded could 
share their findings, thereby reducing costs and eliminating duplication of effort. 

e In 1967, at an Association for Computing Machinery (ACM) meeting, ARPA presented its 
ideas for ARPANET, a small network of connected computers. The idea was that each host 
computer would be attached to a specialized computer, called an interface message processor 
(IMP). The IMPs, in turn, would be connected to one another. Each IMP had to be able to 
communicate with other IMPs as well as with its own attached host. 

* By 1969, ARPANET was a reality. Four nodes, at the University of California at Los 
Angeles (UCLA), the University of California at Santa Barbara (UCSB), Stanford Research 
Institute (SRI), and the University of Utah, were connected via the IMPs to form a network. 
Software called the Network Control Protocol (NCP) provided communication between the 
hosts. 

© In 1972, Vint Cerf and Bob Kahn, both of whomwere part ofthe core ARPANET group, 
collaborated on what they called the Jnternetting Projec!. Cerf and Kahn's landmark 1973 
paper outlined the protocols to achieveyend-to-end delivery of packets. This paper on 
Transmission Control Protocol (TCP) ineluded)concepts)such as encapsulation, the datagram, 
and the functions,of a gateway. 

¢ Authorities made a decision to split, TCP into two protocols: Transmission Control Protocol 
(TCP) and, Internetworking Protocol (IP). IP)would handle datagram routing while TCP 
would be responsible for higher-levelfunctions such as segmentation, reassembly, and error 


detection) The internetworking protocol became known as TCPIIP. 


The Internet Today 

The Internet today is not a simple hierarchical structure. It is made up of many wide- and local- 
area networks joined by connecting devices and switching stations. Today most end users who 
want Internet connection use the services of Internet service providers (ISPs). There are 
international service providers, national service providers, regional service providers, and local 


service providers. The Internet today is run by private companies, not the government. 
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a. Structure of a national ISP 

















b. Interconnection of national ISPs 


International Internet Service Providers 


At the top of the hierarchy are the international service providers that connect nations together. 


National Internet Service Providers 

e The national Internet service proyiders are backbone networks created and maintained by 
specialized companies, 

© To provide connectivity between the end users, these backbone networks are connected by 
complex switching stations called network access points. 

© Some national ISP networks are also connected to one another by private switching stations 


called peering points. 


Regional Internet Service Providers 
Regional internet service providers or regional ISPs are smaller ISPs that are connected to one or 


more national ISPs. They are at the third level of the hierarchy with a smaller data rate. 
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Local Internet Service Providers 
Local Internet service providers provide direct service to the end users. The local ISPs can be 
connected to regional ISPs or directly to national ISPs. Most end users are connected to the local 


ISPs. 


1.4 Protocols and Standards 


Protocols 

¢ In computer networks, communication occurs between/entities in different systems. An entity 
is anything capable of sending or receiving information. However, two entities cannot simply 
send bit streams to each other and expect to be understood. Formcommunication to occur, the 
entities must agree on a protocol. 

e A protocol is a set of rules that govern dataycommunications. A protocol defines what is 


communicated, how it is communicated, and)when it is communicated. 


The key elements of a protocol are syntax, semantics, andtiming. 

e Syntax: The term syntax refers to the structure or format of the data, meaning the order in 
which they are presentedy 
For example, a simple)protocoh might, expect the first 8 bits of data to be the address of the 
sender, the second 8)bits to)be the address of the,receiver, and the rest of the stream to be the 
message itself. 

e Semantics: The word semantics refers to the meaning of each section of bits. How is a 
particular pattern)to be interpreted, and what action is to be taken based on that 
interpretation? 

For example, does an address identify the route to be taken or the final destination of the 
message? 

e Timing: The term timing refers to two characteristics: when data should be sent and how fast 
they can be sent. 

For example, if a sender produces data at 100 Mbps but the receiver can process data at only 


1 Mbps, the transmission will overload the receiver and some data will be lost. 
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Standards 

e Standards are essential in creating and maintaining an open and competitive market for 
equipment manufacturers and in guaranteeing national and international interoperability 
of data and telecommunications technology and processes. 

e Standards provide guidelines to manufacturers, vendors, government agencies, and other 
service providers to ensure the kind of interconnectivity necessary in today's marketplace 
and in international communications. 

e Data communication standards fall into two categories: de facto (meaning "by fact" or 
"by convention") and de jure (meaning "by law" or,"by regulation"). 

= De facto: Standards that have not been approved by an organizedbody but have 
been adopted as standards through widespread use are de facto standards. De 
facto standards are often established originally by manufacturers who seek to 
define the functionality of anew product or technology. 

= De jure: Those standards that have been legislated by an officially recognized 


body are de jure standards. 


Standards Organizations 


Standards are developed through the)cooperation of standards creation committees, forums, and 


governmentregulatory agencies. 


Standards, Creation Committees: 
1) International Organization for Standardization (ISO): 
e The ISO is a multinational body whose membership is drawn mainly from the standards 
creation committees of various governments throughout the world. 
¢ The ISO is active in developing cooperation in the realms of scientific, technological, and 


economic activity. 


2) International Telecommunication Union-Telecommunication Standards Sector (ITU-T): 
e In 1970s the United Nations formed a committee called the Consultative Committee for 
International Telegraphy and Telephony (CCITT), as part of its International 


Telecommunication Union (ITU). 
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e This committee was devoted to the research and establishment of standards for 
telecommunications in general and for phone and data systems in particular. 
e On March 1, 1993, the name of this committee was changed to the International 


Telecommunication Union Telecommunication Standards Sector (ITU-T). 


3) American National Standards Institute (ANSD: 


The American National Standards Institute is a completely private, nonprofit corporation. 


4) Institute of Electrical and Electronics Engineers (IEEE): 

e The Institute of Electrical and Electronics Engineers is the largest professional 
engineering society in the world. 

e International in scope, it aims to advance theory, creativity, and product quality in the 
fields of electrical engineering, electronics, and radio as well as injall related branches of 
engineering. 

e As one of its goals, the IEEE oversees the development and adoption of international 
standards for computing and communications. 

. 

5) Electronic Industries Association (EIA): 

e The Electronic Industries, Association is a nonprofit organization devoted to the 
promotion ofjelectronics manufacturing concerns. 

¢ «Its_activities include, public awareness education and lobbying efforts in addition to 


standards development. 


Forums: 

Standards committees are procedural bodies and by nature slow-moving. To accommodate the 
need for working models and agreements and to facilitate the standardization process, many 
special-interest groups have developed forums made up of representatives from interested 
corporations. The forums work with universities and users to test, evaluate, and standardize new 


technologies. 
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Regulatory Agencies: 

All communications technology is subject to regulation by government agencies such as the 
Federal Communications Commission (FCC) in the United States. The purpose of these 
agencies is to protect the public interest by regulating radio, television, and wire/cable 


communications. 


Internet Standards 

e An Internet standard is a thoroughly tested specification that is useful to and adhered to by 
those who work with the Internet. 

* It is a formalized regulation that must be followed. There is a strict procedure by which a 
specification attains Internet standard status. 

* A specification begins as an Internet draft. An Internetdraft is aworking document (a work in 
progress) with no official status anda,6-month lifetime. 

* Upon recommendation from the Internet authorities, a draft may be published as a Request for 
Comment (RFC). 

* Each RFC is edited, assigned a number, and made available to all interested parties. RFCs go 


through maturity level§,and are categorized according to their requirement level. 


1.5 Layered Tasks 


Consider théexample of postal mail, 


Sender Receiver 
2 2 


f f 


Higher layers 


Middle layers 


Lower layers 


The parcel is carried from 
the source to the destination. 
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At the Sender Site 
e Higher layer. The sender writes the letter, inserts the letter in an envelope, writes the 
sender and receiver addresses, and drops the letter in a mailbox. 
e Middle layer. The letter is picked up by a letter carrier and delivered to the post office. 


e Lower layer. The letter is sorted at the post office; a carrier transports the letter. 


On the Way 
The letter is then on its way to the recipient. On the way to theecipient's local post office, the 
letter may actually go through a central office. In addition, it may be transported by truck, train, 


airplane, boat, or a combination of these. 


At the Receiver Site 
¢ Lower layer. The carrier transports'the letter to the post office. 
e Middle layer. The letter is sorted and delivered to the recipient's mailbox. 


e Higher layer. The receiver picks up'the letter, opens the envelope, and reads it. 


The task of transporting the letter between the sender and the receiver is done by the carrier. 

At the sender site, the letter must be written and,dropped in the mailbox before being picked up 
by the letter carrier and delivered)to the post office. 

At the receiver, site, the lettermust be dropped in)the recipient mailbox before being picked up 
and readby the recipient. 

Each layer at the sending site uses the services of the layer immediately below it. The sender at 
the higher layer usés,the services of the middle layer. The middle layer uses the services of the 


lower layer. The lower layer uses the services of the carrier. 


1.6 The OSI Model 


e An ISO standard that covers all aspects of network communications is the Open Systems 
Interconnection model. 

¢ It was first introduced in the late 1970s. 

e An open system is a set of protocols that allows any two different systems to communicate 


regardless of their underlying architecture. 
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© The purpose of the OSI model is to show how to facilitate communication between different 
systems without requiring changes to the logic of the underlying hardware and software. 

e The OSI model is a layered framework for the design of network systems that allows 
communication between all types of computer systems. 


e It consists of 7 following dedicated Layers. 

















7 Application | 
6 Presentation. | 
5 Session | 
See) 
3 Network | 
se 
aaa) 


Layered Architecture 

¢ The OSI model is composed ofjseven)ordered layers: )physical (layer 1), data link (layer 2), 
network (layer 3), transport (layer 4)psession (layer 5), presentation (layer 6), and application 
(layer 7). 


¢ Below Figure shows the layers involved whema message is sent from device A to device B. 


Device Device 
A B 








Intermediate Intermediate 
node node 


= — 
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e As the message travels from A to B, it may pass through many intermediate nodes. These 
intermediate nodes usually involve only the first three layers of the OSI model. 

e In developing the model, the designers identified which networking functions had related 
uses and collected those functions into discrete groups that became the layers. 

e Each layer defines a family of functions distinct from those of the other layers. By defining 
and localizing functionality in this fashion, the designers created an architecture that is both 
comprehensive and flexible. 

¢ Within a single machine, each layer calls upon the services of the layer just below it. 

e Layer 3, for example, uses the services provided by layer 2 and provides services for layer 4. 
This communication is governed by an agreed-upon series of rules and)conyentions called 
protocols. 

© The processes on each machine that communicate at agiven layer are called peer-to-peer 
processes. Communication between machines is therefore a peer-to-peer process using the 


protocols appropriate to a given layer. 


Peer-to-Peer Processes, 

e At the physical layer, communication is direct. 

e At the higher layers, however, communication\must move down through the layers on device 
A, over'to device B, and then back up'through the layers. 

e Each layer inthe sending device adds its own information to the message it receives from the 
layer just above it and passes the whole package to the layer just below it. 

e At layer.! the,entire package\is converted to a form that can be transmitted to the receiving 
device. 

e At the receiving machine, the message is unwrapped layer by layer, with each process 
receiving and removing the data meant for it. 

e For example, layer 2 removes the data meant for it, then passes the rest to layer 3. Layer 3 


then removes the data meant for it and passes the rest to layer 4, and so on. 
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Interfaces between Layers 

e The passing of the data and network information down through the layers of the sending 
device and back up through the layers of the receiving device is made possible by an 
interface between each pair of adjacent layers. 

e Each interface defines the information and services a layer must provide for the layer above 


it. Well-defined interfaces and layer functions provide modularity to a network. 


Organization of the Layers 

e Layers 1, 2, and 3-physical, data link, and network-aré,the network support layers; they deal 
with the physical aspects of moving data from.one device to another (such as electrical 
specifications, physical connections, physical addressing, and transport timing and 
reliability). 

¢ Layers 5, 6, and 7-session, presentation, and application-can be thought of as the user support 
layers; they allow interoperability among unrelated software systems. 

e Layer 4, the transport layer, links the two subgroups and,ensures that what the lower layers 
have transmitted is in aform that the upper layers camuse. 

e The upper OSI layers are almost,always implemented in software; lower layers are a 
combination of hardware. and software, except for the physical layer, which is mostly 
hardware, 


Below figure gives an overall view of the OSI layers, here D7 means the data unit at layer 7, D6 


means thedata unit at layer 6)and So on. 
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© The process starts at layer 7 (the application layer), then moves from layer to layer in 
descending, sequential order. 

e At each layer, a header, or possibly a trailer, can be added to the data unit. Commonly, the 
trailer is added only at layer 2. 

e When the formatted data unit passes through the physical layer (layer 1), it is changed into an 
electromagnetic signal and transported along a physical link. 

© Upon reaching its destination, the signal passes into layer 1 and is transformed back into 
digital form. 

¢ The data units then move back up through the OSI layers. 

e As each block of data reaches the next higher layer, the headers and trailers attached to it at 
the corresponding sending layer are removed, and actions appropriate to that layer are taken. 

¢ By the time it reaches layer 7, the message is again in aform appropriate to the application 


and is made available to the recipient. 


Encapsulation 

A packet (header and data)yat level 7 is encapsulated in/a packet at level 6. The whole packet at 
level 6 is encapsulated in a packet atlevel 5, and so on. In other words, the data portion of a 
packet at level N - 1 carries. the whole packet (data and header and maybe trailer) from level N. 
The concept is called encapsulation; level N=1 is not aware of which part of the encapsulated 
packet is data and which partis the header or trailer. For level N - 1, the whole packet coming 


from level.N is treated as oneintegral unit. 


1.7 Layersin OSI Model 





e The physical layer coordinates the functions required to carry a bit stream over a physical medium. 
e It deals with the mechanical and electrical specifications of the interface and transmission medium. 
e It also defines the procedures and functions that physical devices and interfaces have to perform for 


transmission to Occur. 
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From data link layer To data link layer 


Physical Physical 
layer layer 





Transmission medium 


6) 


The physical layer is also concerned with the following: 


1. Physical characteristics of interfaces and mediu 





: The physical layer defines the characteristics 
of the interface between the devices and the transmission medium. It also defines, the type of 
transmission medium. 

2. Representation of bits: The physical layer data consists of a stream of bits,(sequence of Os or Is) 
with no interpretation. In order to transmit these bitspit must be encoded into electrical or optical 
signals. The physical layer defines, the type of encoding (how. Os and | s are changed to signals). 

3. Data rate: The transmission rate-the numberof bits sent each second-is also defined by the physical 
layer. In other words, the physical layer defines the duration,of a bit, which is how long it lasts. 

4. Synchronization of bit: The sender and receiver not only,must use the same bit rate but also must be 
synchronized at the bit leyel.In other, words, the sender and the receiver clocks must be synchronized. 

5. Line configuration: The physical layer.is concerned with the connection of devices to the media: 
Point — to ~ point.configuration or multipointconfiguration. 

6. Physical topology: The physical. topology defines how devices are connected to make a network. 
Devices canbe connected by using a mesh topology, a star, a ring topology, a bus topology, or a 


hybrid topology. 





7. Transmission mode: The physical layer also defines the direction of transmission between two 


devices: simplex, half-duplex, or full-duplex. 


Data Link Layer 


The data link layer is responsible for moving frames from one hop (node) to the next. 
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From network layer To network layer 














Data link layer Data link layer 


To physical layer From physical layer 


Other responsibilities of the data link layer include the following: 
1. Framing: The data link layer divides the stream of bits received from the network layer into 


manageable data units called frames. 


Nv 


Physical addressing: If frames are to,be distributed to different systems on the network, the 

data link layer adds a header to the frame'to define the sender and/or receiver of the frame. If 

the frame is intended for a system outside the sender's network, the receiver address is the 
address of the device that connects the network to the nextione. 

3. Flow control: If the rate at which the data aréabsorbed by the receiver is less than the rate at 
which data are produced in the sender, the data link layer imposes a flow control mechanism 
to avoid overwhelming théreceiver, 

4. Error control: The data link, layer, adds) reliability to the physical layer by adding 
mechanisms to detectyand retransmit damaged or lost frames. It also uses a mechanism to 
recognize duplicate frames. Error control is normally achieved through a trailer added to the 
end of the frame. 

5. Access control: When two or more devices are connected to the same link, data link layer 

protocols are necessary to determine which device has control over the link at any given 


time. 


Below figure illustrates hop-to-hop (node-to-node) delivery by the data link layer. 
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As the figure shows, communication atthe data link layer occurs between two adjacent nodes. To 
send data from A to F)three partial deliveries are made. First, the data link layer at A sends a frame 
to the data link layer at B (a router), Second, the,data link layer at B sends a new frame to the data 
link layer at E. Finallypthe datalink layer at E sends anew frame to the data link layer at F. 

The frame from A to B has B as the destination address and A as the source address. The frame 
from B to E has E as the destination address and B as the source address. The frame from E to F 
has F as the destination address/and E as the source address. The values of the trailers can also be 


different if error checking includes the header of the frame. 


Network Layer 

The network layer is responsible for the source-to-destination delivery of a packet, possibly 
across multiple networks (links). Whereas the data link layer oversees the delivery of the packet 
between two systems on the same network (links), the network layer ensures that each packet 


gets from its point of origin to its final destination. 
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From transport layer To transport layer 
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Network 
layer 





To data link layer From data link layer 


Other responsibilities of the network layer include the following: 

1. Logical addressing: The physical addressing implemented by the data link layer handles the 
addressing problem locally. If a packet passes the network boundary, we need another 
addressing system to help distinguish)the source and destination systems. The network layer 
adds a header to the packet coming from)the upper layer that, among other things, includes 
the logical addresses of the sender and.receiver. 

2. Routing: When independent networks )or links, are connected to create internetworks 
(network of networks) or a largenetwork, the)connecting devices (called routers or switches) 
route or switch)the packets to, their)final destination. One of the functions of the network 


layer is to provide this mechanism) 


Below figure illustrates end-to-end delivery by the network layer. 


The network layer,at A sends the packet to the network layer at B. When the packet arrives at 
router B, the router makes a decision based on the final destination (F) of the packet. Router B 
uses its routing table to find that the next hop is router E. The network layer at B, therefore, 
sends the packet to the network layer at E. The network layer at E, in tum, sends the packet to the 


network layer at F. 
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Transport Layer 


The transport layer.is responsible for process-to-process delivery of the entire message. 

A process is an)application program running on a host. 

Whereas the network layer oversees source-to-destination delivery of individual packets, it does 
not recognize any relationship between those packets. It treats each one independently, as though 
each piece belonged to a separate message, whether or not it does. 

The transport layer, on the other hand, ensures that the whole message arrives intact and in order, 


overseeing both error control and flow control at the source-to-destination level. 
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Transport Transport 
layer To network layer From network layer layer 


Other responsibilities of the transport layer include the following: 

1. Service-point addressing: Computers often run several programs at the same time. For this 
reason, source-to-destination delivery means delivery notonly from one computer to the next 
but also from a specific process (running program) on oné,computer,to a specific process 
(running program) on the other. The transport layer header must,therefore include a type of 
address called a service-point address (or/port address),The network layer gets each packet 
to the correct computer; the transport layer gets the,entire message to the correct process on 
that computer. 

2. Segmentation and reassembly: A message is divided into transmittable segments, with each 
segment containing a sequence number» These numbers enable the transport layer to 
reassemblé,the message correctly upon arriving, at the destination and to identify and replace 
packets that were lost imjtransmission, 

3. Connection control: The transport layer can be either connectionless or connection oriented. 
A connectionless,transport layer treats each segment as an independent packet and delivers it 
to the transport layer at.the destination machine. A connection oriented transport layer makes 
a connection with the transport layer at the destination machine first before delivering the 
packets. After all the data are transferred, the connection is terminated. 

4. Flow control: Like the data link layer, the transport layer is responsible for flow control. 
However, flow control at this layer is performed end to end rather than across a single link. 

5. Error control: Like the data link layer, the transport layer is responsible for error control. 
However, error control at this layer is performed process-to process rather than across a 


single link. The sending transport layer makes sure that the entire message arrives at the 
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receiving transport layer without error (damage, loss, or duplication). Error correction is 
usually achieved through retransmission. 


Below figure illustrates process-to-process delivery by the transport layer. 
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Session Layer 
The session layer is the network dialog controller, It establishes, maintains, and synchronizes the 


interaction among communicating systems. 


From presentation layer To presentation layer 














Session 
layer 





To transport layer From transport layer 


Specific responsibilities of the session layer include the following: 
1. Dialog control: The session layer allows two systems to enter into a dialog. It allows the 
communication between two processes to take place in either half duplex or full-duplex 


mode. 
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2. Synchronization: The session layer allows a process to add checkpoints, or synchronization 
points, to a stream of data. 
For example, if a system is sending a file of 2000 pages, it is advisable to insert checkpoints 
after every 100 pages to ensure that each 100-page unit is received and acknowledged 
independently. In this case, if a crash happens during the transmission of page 523, the only 
pages that need to be resent after system recovery are pages 501 to 523. Pages previous to 


501 need not be resent. 


Presentation Layer 


The presentation layer is concerned with the syntax and semantics of the information exchanged 


between two systems. 


From application layer To application layer 


Presentation 
layer 











Presentation 
layer 


To session layer From session layer 


Specifieresponsibilities of the presentation layer include the following: 

1. Translation: The processes (running programs) in two systems are usually exchanging 
information in the formpof.character strings, numbers, and so on. The information must be 
changed to bit streams, before being transmitted. Because different computers use different 
encoding systems, the presentation layer is responsible for interoperability between these 
different encoding methods. The presentation layer at the sender changes the information 
from its sender-dependent format into a common format. The presentation layer at the 


receiving machine changes the common format into its receiver-dependent format. 


Nv 


Encryption: To carry sensitive information, a system must be able to ensure privacy. 


Encryption means that the sender transforms the original information to another form and 
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sends the resulting message out over the network. Decryption reverses the original process to 
transform the message back to its original form. 

3. Compression: Data compression reduces the number of bits contained in the information. 
Data compression becomes particularly important in the transmission of multimedia such as 


text, audio, and video. 


Application Layer 


The application layer enables the user to access the network. It provides user interfaces and 
support for services such as electronic mail, remote file access and transfer, shared database 
management, and other types of distributed informatiom)services. 

Of the many application services available, the figure shows only, three: XAOO (message- 
handling services), X.500 (directory setyices), and file transfer, access, and management 


(FTAM). 


User User 
(human or program) (human or program) 


= ae 


——— pened 





Message 











Application Application 
layer layer 


To presentation layer From presentation layer 
Specific services provided by the application layer include the following: 
1. Network virtual terminal: A network virtual terminal is a software version of a physical 
terminal, and it allows a user to log on to a remote host. To do so, the application creates a 


software emulation of a terminal at the remote host. The user's computer talks to the software 
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terminal which, in turn, talks to the host, and vice versa. The remote host believes it is 
communicating with one of its own terminals and allows the user to log on. 

2. File transfer, access, and management: This application allows a user to access files in a 
remote host (to make changes or read data), to retrieve files from a remote computer for use 
in the local computer, and to manage or control files in a remote computer locally. 

3. Mail services: This application provides the basis for e-mail forwarding and storage. 

4. Directory services: This application provides distributed database sources and access for 


global information about various objects and services. 


Summary of Layers 





To allow access to network 
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To translate, encrypt, and = a 
compress data 
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; : terminate sessions 
To provide reliable process-to- 


process message delivery and 
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To organize bits into frames; 
to provide hop-to-hop delivery 











To transmit bits over a medium; 
to provide mechanical and 
electrical specifications 














1.8 TCP/IP Protocol Suite 


The original TCP/IP protocol suite was defined as having four layers: host-to-network, internet, 
transport, and application. However, when TCP/IP is compared to OSI, we can say that the host- 
to-network layer is equivalent to the combination of the physical and data link layers. The 
internet layer is equivalent to the network layer, and the application layer is roughly doing the 
job of the session, presentation, and application layers with the transport layer in TCP/IP taking 


care of part of the duties of the session layer. 
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Physical and Data Link.Layers 
At the physical and,data link layers, TCP/IP does not define any specific protocol. It supports all 
the standard and proprietary protocols. A networkin a TCP/IP internetwork can be a local-area 


network or ‘@ywvide-area network. 


Network Layer 

At the network layer (or;.more accurately, the internetwork layer), TCP/IP supports the 
Internetworking ProtocolIP, in turn, uses four supporting protocols: ARP, RARP, ICMP, and 
IGMP. 


Internetworking Protocol (IP) 
¢ The Internetworking Protocol (IP) is the transmission mechanism used by the TCP/IP 
protocols. 


e  Itis an unreliable and connectionless protocol-a best-effort delivery service. 
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e The term best effort means that IP provides no error checking or tracking. IP assumes the 
unreliability of the underlying layers and does its best to get a transmission through to its 
destination, but with no guarantees. 

e IP transports data in packets called datagrams, each of which is transported separately. 

e Datagrams can travel along different routes and can arrive out of sequence or be duplicated. 
IP does not keep track of the routes and has no facility for reordering datagrams once they 


arrive at their destination. 


Address Resolution Protocol (ARP) 
© The Address Resolution Protocol (ARP) is used to associate a logical address)with a physical 
address. 


e ARP is used to find the physical address of the node when‘its Internet address is known. 


Reverse Address Resolution Protocol (RARP) 

© The Reverse Address Resolution Protocol (RARP) allows a host to discover its Internet 
address when it knows only its physical address. 

e It is used when a computer is connected.to a network for the first time or when a diskless 


computer is booted. 


Internet Contrel Message Protocol (ICMP) 
The Internet Control Message Protocol (ICMP) is a mechanism used by hosts and gateways to 
send notification of datagram problems back to the sender. ICMP sends query and error reporting 


messages. 


Internet Group Message Protocol (IGMP) 
The Internet Group Message Protocol (IGMP) is used to facilitate the simultaneous transmission 


of a message to a group of recipients. 


Transport Layer 


Traditionally the transport layer was represented in TCP/IP by two protocols: TCP and UDP. 
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IP is a host-to-host protocol, meaning that it can deliver a packet from one physical device to 
another. UDP and TCP are transport level protocols responsible for delivery of a message from a 
process (running program) to another process. A new transport layer protocol, SCTP, has been 


devised to meet the needs of some newer applications. 


User Datagram Protocol (UDP) 

¢ The User Datagram Protocol (UDP) is the simpler of the two standard TCP/IP transport 
protocols. 

¢ It is a process-to-process protocol that adds only port addresses, checksum. error control, and 


length information to the data from the upper layer. 


Transmission Control Protocol (TCP) 

¢ The Transmission Control Protocoh (TCP) provides full transport-layer services to 
applications. 

e TCP is a reliable stream transport protocol»The term stream, in this context, means 
connection-oriented: A_connection‘must be. established between both ends of a transmission 
before either can transmit data. 

e At the sending end of each transmission, TCP divides a stream of data into smaller units 
called segments. Each segment includes)a sequence number for reordering after receipt, 
together with an acknowledgmentnumber for the segments received. 

e Segments are carried across the internet inside of IP datagrams. At the receiving end, TCP 
collects each datagram as it comes in and reorders the transmission based on sequence 


numbers. 


Stream Control Transmission Protocol (STCP) 
The Stream Control Transmission Protocol (SCTP) provides support for newer applications such 
as voice over the Internet. It is a transport layer protocol that combines the best features of UDP 


and TCP. 
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Application Layer 


The application layer in TCP/IP is equivalent to the combined session, presentation, and 
application layers in the OSI model. Many protocols are defined at this layer like HTTP, FTP, 
TELNET ete. 


1.9 Addressing 


Four levels of addresses are used in an internet employing the TCP/IP protocols: 
e physical (link) addresses 
e logical (IP) addresses 
© port addresses 


© specific addresses 


Relationship of layers and addresses in TCP/IP is shown below: 





Application layer 


Transport layer 


Network layer 


other protocols 


Data link layer 


Physical layer 
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Physical Addresses 

e The physical address, also known as the link address, is the address of a node as defined by 
its LAN or WAN. 

e It is included in the frame used by the data link layer. It is the lowest-level address. 

e The size and format of these addresses vary depending on the network. 

e For example, Ethernet uses a 6-byte (48-bit) physical address that is imprinted on the 


network interface card (NIC). 








Example: 
In the Figure a node with physical address 10 sends a frame to,a node with physical address 87. 
The two nodes are connected by a link (bus topology LAN). As the figure shows, the computer 


with physical address 10 is the sender, and the computer with physical address 87 is the receiver. 
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address Destination address does 














not match; the packet is 
dropped 








Source 
address LAN 





Most local-area networks use a48-bit,(6-byte) physical address written as 12 hexadecimal digits; 
every byte (2, hexadecimalidigits) is separated by a colon, as: 07:01:02:01:2C:4B 


Logical Addresses 

e Logical addresses are necessary for universal communications that are independent of 
underlying physical networks. 

e Physical addresses are not adequate in an internetwork environment where different networks 
can have different address formats. 

e A universal addressing system is needed in which each host can be identified uniquely, 


regardless of the underlying physical network. 
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e A logical address in the Internet is currently a 32-bit address that can uniquely define a host 
connected to the Internet. No two publicly addressed and visible hosts on the Internet can 


have the same IP address. 





Example: 


Figure shows a part of an internet with two routers connecting three LANs. Each device 
(computer or router) has a pair of addresses (logical and physical) foreach connection. In this 
case, each computer is connected to only one link and therefore;has only one pair of addresses. 
Each router, however, is connected to three networks (only two are shown in the figure). So each 


router has three pairs of addresses, one for each connection. 
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Port Addresses 


e The IP address and the physical address are necessary for a quantity of data to travel from a 
source to the destination host. However, arrival at the destination host is not the final 
objective of data communications on the Internet. A system that sends nothing but data from 
one computer to another is not complete. 

e Today, computers are devices that can run multiple processes atthe same time. The end 
objective of Internet communication is a process communicating with another process. 

e For example, computer A can communicate with computer C byusing TELNET. At the same 
time, computer A communicates with computer B by using the File Transfer Protocol (FTP). 
For these processes to receive data simultaneously, we need.a method to label, the different 
processes. 

e In other words, they need addresses, In the TCP/IP architecture, the label assigned to a 
process is called a port address. A port address in) TCP/IP is 16 bits in length. 


e A port address is a 16-bit address represented by one decimal number 








Example: 


Figure shows two computers communicating.via the Internet. The sending computer is running 
three processes at this time with port addresses a, b, and c. The receiving computer is running 
two processes at this time with port addresses j and k, Process a in the sending computer needs to 
communicate. with process j in the receiving computer. Note that although physical addresses 
change from hop»to hop, logical and port addresses remain the same from the source to 


destination. 
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Specific Addresses 


Some applications have user-friendly addresses thatare designed for that specific address. 


Example: e-mail address and the Universal Resource Locator (URL) 


PHYSICAL LAYER 


¢ One of the major functions of the physical layer is to move data in the form of 
electromagnetic signals across a transmission medium. 


© To be transmitted, data must be transformed to electromagnetic signals. 


2.1 Analog and Digital 
Analog and Digital Data 


e Data can be analog or digital. The term analog data refers to information that is continuous; 
digital data refers to information that has discrete states. 

¢ For example, an analog clock that has hour, minute, and second hands gives information in a 
continuous form; the movements of the hands are continuous. 


e On the other hand, a digital clock that reports the hours and the minutes will change suddenly 


from 8:05 to 8:06. 
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e Analog data, such as the sounds made by a human voice, take on continuous values. When 
someone speaks, an analog wave is created in the air. This can be captured by a microphone 
and converted to an analog signal or sampled and converted to a digital signal. 

e Digital data take on discrete values. For example, data are stored in computer memory in the 
form of Os and 1s. They can be converted to a digital signal or modulated into an analog 


signal for transmission across a medium. 


Analog and Digital Signals 

e Like the data they represent, signals can be either analog or digital. 

e An analog signal has infinitely many levels of intensity over a period of time. As the wave 
moves from value A to value B, it passes through and ineludes an infinite number of values 
along its path. 

e A digital signal, on the other hand, can have only a)limited number of defined values. 


Although each value can be any number, it is often as simple as 1 and O» 





Value Value 


ry 


Saye . 
Time tl Time 


a. Analog signal b. Digital signal 























Periodic and Nonperiodie Signals 

¢ Both analog and digital signals can take one of two forms: periodic or nonperiodic 

e A periodic signal completes a pattern within a measurable time frame, called a period, and 
repeats that pattern over subsequent identical periods. 

e The completion of one full pattern is called a cycle. 

e A nonperiodic signal changes without exhibiting a pattern or cycle that repeats over time. 

e In data communications, we commonly use periodic analog signals and nonperiodic digital 


signals. 


3.2 Periodic Analog Signals 


e Periodic analog signals can be classified as simple or composite. 
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e A simple periodic analog signal is a sine wave which cannot be decomposed into simpler 


signals. 


e A composite periodic analog signal is composed of multiple sine waves. 


Sine Wave 


© The sine wave is the most fundamental form of a periodic analog signal. 
e Sine wave is a simple oscillating curve, its change over the course of a cycle is smooth and 
consistent, a continuous, rolling flow. 


eA sine wave can be represented by three parameters: the peak amplitude, the frequency, and 


the phase. 


Value 





—_ 
Time 


1) Peak amplitude 
The peak amplitude of a signal is theyabsolute value of its highest intensity, proportional to the 


energy it carries. Forelectric signals, peak\amplitude is normally measured in volts. 




















Amplitude 
aes _ 
Time 
a. A signal with high peak amplitude 
Amplitude 
A 
wee = 
Time 











b. A signal with low peak amplitude 


2) Period and Frequency 
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e Period refers to the amount of time, in seconds, a signal needs to complete 1 cycle. 

e Frequency refers to the number of periods in one second. 

e Period is the inverse of frequency, and frequency is the inverse of period, as the following 
formulas show. 


e Period is formally expressed in seconds. Frequency is formally expressed in Hertz (Hz), 


which is cycle per second. 
























































Amplitude 
12 periods in 1s ——» Frequency is 12 Hz 
1s 
sees 
Time 
Period: s 
a. A signal with a frequency of 12 Hz 
Amplitude 
6 periods in 1 s ——®* Frequency is 6 Hz 
ds oe 
Sete Ue 
i Time 
zi 
Period: 3 s 
b. A signal with a frequency of 6 Hz 
Units of period and frequency are shown below: 
Unit Equivalent Unit Equivalent 

Seconds (s) Is Hertz (Hz) 1 Hz 
Milliseconds (ms) 103s Kilohertz (kHz) 108 Hz 
Microseconds ({ts) 10°%s Megahertz (MHz) 10° Hz 
Nanoseconds (ns) 107 s Gigahertz (GHz) 10° Hz 
Picoseconds (ps) 10! s Terahertz (THz) 10!? Hz 























Example: 


1) The power we use at home has a frequency of 60 Hz. The period of this sine wave can be 


determined as follows: 
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2) Express a period of 100 ms in microseconds. 


Solution: 





3) The period of a signal is 100 ms. What is its frequency in kilohertz? 
Solution: 
First we change 100 ms to seconds, and then we calculate the frequency from the period (1 Hz = 


10° kHz). 


100 ms = 100 x 103s =107!s 








; Hz = 10 Hz = 10x 10°? kHz = 10°? kHz 








Frequency is the rate of change with respect to time. Change in a short span of time means high 
frequency. Change over a long spanjof time.means low frequency. 
If a signal does not,change at all)its frequency is zero. If a signal changes instantaneously, its 


frequency is infinite. 


3) Phase 

© The termphase describes the position of the waveform relative to time 0. 

¢ If we think of the wave.as something that can be shifted backward or forward along the time 
axis, phase deseribes}the amount of that shift. It indicates the status of the first cycle. 

¢ Phase is measured in degrees or radians. 

e A phase shift of 360° corresponds to a shift of a complete period; a phase shift of 180° 
corresponds to a shift of one-half of a period; and a phase shift of 90° corresponds to a shift 


of one-quarter of a period. 
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WAT 1 Time 


b. 90 degrees 








a. O degrees 














A 


lls sia 
- | Time 
aT INS 


¢. 180 degrees 

















Looking at above Figure, we can say that 

e A sine wave with a phase of 0° starts atytime O with a zero amplitude. The amplitude is 
increasing. 

e A sine wave with a phase of 90° starts at.time 0 with a peak amplitude. The amplitude is 
decreasing. 

e A sine wave with a phase of 180° starts at time 0 with a zero amplitude. The amplitude is 
decreasing. 

Another way to look at the phase is imterms)of shift or offset. We can say that 

e A sine wave with a phase of 0° is not shifted. 

e A sine waye with a phase of 90%is shifted to the left by 1/4 cycle. However, note that the 
signal does not really exist,before time 0. 

e A sine wavewith a phase of 180° is shifted to the left by 1/2 cycle. However, note that the 


signal does not really exist before time 0. 


Wavelength 

© Wavelength binds the period or the frequency of a simple sine wave to the propagation speed 
of the medium. 

e The wavelength depends on both the frequency and the medium. Wavelength is a property of 
any type of signal. 
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© In data communications, we often use wavelength to describe the transmission of light in an 
optical fiber. 
e The wavelength is the distance a simple signal can travel in one period. 
e Since period and frequency are related to each other, if we represent wavelength by A, 
propagation speed by c (speed of light), and frequency by], we get 
Wavelength =propagation speed x period = Propogation Speed/Frequency 
Wavelength 


Transmission medium Dv fhyPo f 
Attimet 1 


1 

T 

1 
i] a 7 
1 1 Direction of 
' 1 

1 

I 


secs : ti 
Transmission medium a a propagation 
Attimet+T 


e The propagation speed of electromagnetic signals depends on the medium and on the 











frequency of the signal. 
For example, in a vacuum, light is\propagated witha speed of 3x 10° m/s. That speed is 


lower in air and even lowerin cable. 


Time and Frequency Domai 

A sine wave is comprehensively. defined by its amplitude, frequency, and phase. 

The time-domain plot shows changes in signal amplitude with respect to time (it is an amplitude- 
versus-time plot))Phase isnot explicitlyshown on a time-domain plot. 

To show the,relationship between amplitude and frequency, we can use frequency-domain plot. 
A frequency-domain plots concerned with only the peak value and the frequency. Changes of 


amplitude during one period are not shown. 
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Amplitude 
A Frequency: 6 Hz F 
1 
1 
' 
tose ie 
Time 
©) 
a. A sine wave in the time domain (peak value: 5 V, frequency: 6 Hz) 
Amplitude 
> 
Frequency 
(Hz) 











b. The same sine wave in the frequency domain (peak value: 5 V, frequency: 6 Hz) 


A complete sine wave in the time domain can be represented by one single spike in the 


frequency domain. 


Composite Signa 


¢ Accomposite signal is made of many,simple sine wayes. 





e A single frequency sine wave’ isnot useful in data communications; we need to send a 
composite signal,.a signal made,of many simple sine waves. 

e According to Fourier,analysis, any\composite signal is a combination of simple sine waves 
with different frequencies, amplitudes, and phases. 

e A composite signal can be periodic ornonperiodic. 

¢ A periodi¢,composite signal can)be decomposed into a series of simple sine waves with 
discrete frequencies-frequencies that have integer values (1, 2, 3, and so on). 

« A nonperiodi¢ composite signal can be decomposed into a combination of an infinite number 


of simple sine waves with continuous frequencies, frequencies that have real values. 
Bandwidth 


e The range of frequencies contained in a composite signal is its bandwidth. The bandwidth is 
normally a difference between two numbers. 
For example, if a composite signal contains frequencies between 1000 and 5000, its 


bandwidth is 5000 - 1000, or 4000. 
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Amplitude 


1000 


Bandwidth = 5000 - 1000 = 4000 Hz 





5000 Frequency 








a. Bandwidth of a periodic signal 





Amplitude 





1000 
Bandwidth = 5000 - 1000 = 4000 Hz 


_ 
5000 Frequency 














b. Bandwidth of a nonperiodic signal 


2.3 Digital Signals 


In addition to being represented by)an analog signal, information can also be represented by a 


digital signal. For example,.a 1 can be’encoded as a positive voltage and a 0 as zero voltage. A 


digital signal can have more than two,levels. In this case, we can send more than | bit for each 


level. If a signal has Z levels, each level needs Jog2L bits. 
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Amplitude 8 bits sent in 1 s, 
a Bit rate = 8 bps 





Level 2 






Level 1 4 





a. A digital signal with two levels 

















Amplitude 
Ah 16 bits sent in 1 s, 
Bit rate — 16 bps 
My 105 OL + OL} 00 + 00; OO; 10 4 
er Bs wage Mace es ei Hies ONE sect ear HEP 
H H i i I H i 
Level 3 ' ' H f 
H f ' ' H . 
H > 
ae j A A ' t 1s Time 
7 1 1 1 f 1 
Level | Hl ' H r t 
H H ' i i 











b. A digital signal with four levels 


Bit Rate 


The bit rate is the number of bits.sent in 1s, expressed inbits per second. 


Example: 
1) A digital signal haseight levels. How many bits are needed per level? 


Solution: 


Number of bits per level = log, 8 


2) Assume we need to download text documents at the rate of 100 pages per minute. What 
is the required bit rate of the channel? 

Solution: 

A page is an average of 24 lines with 80 characters in each line. If we assume that one character 


requires 8 bits, the bit rate is 


100 x 24 x 80 x 8 = 1,636,000 bps = 1.636 Mbps 
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3) A digitized voice channel, as we will see in Chapter 4, is made by digitizing a 4-kHz 
bandwidth analog voice signal. We need to sample the signal at twice the highest frequency 
(two samples per hertz). We assume that each sample requires 8 bits. What is the required 
bit rate? 
Solution: 


Bit rate is: 


2 x 4000 x 8 = 64,000 bps = 64 kbps 














Bit Length 
The bit length is the distance one bit occupies on the transmission medium. 


Bit length =propagation speed x bit duration) 


Transmission of Digital Signals 
I) Baseband Transmission 
Baseband transmission means sending )a digital signah over a channel without changing the 


digital signal to an analog signal. 


ae ei Sao 


Digital signal 


— EO 


Channel 





























Baseband transmissionrequires that we have a low-pass channel, a channel with a bandwidth 
that starts from zero. This’ is the case if we have a dedicated medium with a bandwidth 
constituting only oné\channel. 

For example, the entire bandwidth of a cable connecting two computers is one single channel. 
Baseband transmission of a digital signal that preserves the shape of the digital signal is possible 
only if we have a low-pass channel with an infinite or very wide bandwidth. 

In baseband transmission, the required bandwidth is proportional to the bit rate; if we need to 
send bits faster, we need more bandwidth. 

2) Broadband Transmi: 
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Broadband transmission or modulation means changing the digital signal to an analog signal for 
transmission. Modulation allows us to use a bandpass channel-a channel with a bandwidth that 


does not start from zero. This type of channel is more available than a low-pass channel. 


ae > 


f, Bandpass channel f, Frequency 


Amplitude 





If the available channel is a bandpass channel we cannot send the digital signal directly to the 


channel; we need to convert the digital signal to an analog signal, before transmission. 


2.4 Transmission Impairment. 
Signals travel through transmission media, which are/not perfect. The imperfection causes signal 
impairment. This means that the signal at the beginning of the medium is not the same as the 


signal at the end of the medium, 


Three causes of impairment are attenuation) distortion, and)noise. 


Impairment 
causes 








oe 


Attenuation 


ae 


¢ Attenuation means a loss of energy. 
e When a signal, simple or composite, travels through a medium, it loses some of its energy in 
overcoming the resistance of the medium. 


e To compensate for this loss, amplifiers are used to amplify the signal. 
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Original Attenuated Amplified 


na AN 






Point 1 Transmission medium Point 2 Point 3 


Decibel: 
The decibel (dB) measures the relative strengths of two signals)or one signal at two different 


points. The decibel is negative if a signal is attenuated andypositive if a signal is amplified. 


P, 
dB =10 loge P. 
1 


Variables PI and P2 are the powers of a Signal at points 1 and 2,,respectively. 








Example: 
1) Suppose a signal travels through a transmission medium and,its power is reduced to one-half. 


This means that P2 is (1/2)P1. In this case, the attenuation (loss of power) can be calculated as 





5P 


Il 
| 
vs) 
a 
ve) 





P 0.5 
10 logy me 10 logio 1 = 10 logy 0.5 = 10(-0.3) 
1 1 








A loss of 3)dB (—3 dB) is equivalentto losing one-half the power. 
2) A signal travels through an amplifier, and its power is increased 10 times. This means that 


P2=10P1 . Inthis casé)the amplification (gain of power) can be calculated as 





P, 
10 logio Pp. 


1 














= 10 log,9 10 = 1001) = 10 dB 
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3) One reason that engineers use the decibel to measure the changes in the strength of a signal is 
that decibel numbers can be added (or subtracted) when we are measuring several points 


(cascading) instead of just two. In Figure a signal travels from point 1 to point 4. 
1dB 








Y 
ry 
¥ 


lL. -3. dB + 7 dB -3dB 
1 
1 
i 











Point 1 Transmission Point 2 Point 3 Transmission —_ Point 4 
medium medium 


In this case, the decibel value can be calculated as 


dB =-3+7-3=+1 














4) Sometimes the decibel is used to measure signal power in milliwatts. In this case, it is referred 
to as dBm and is calculated as dBm =10elog10, Pm , where Pm is the power in milliwatts. 
Calculate the power of a signal with dBm = —30. 


Solution: 





dB = 10 logig Py, = —30 
logio Pm = —3 Pin= 10-7 mw 











5) The loss in a cable is usually,defined,in decibels per kilometer (dB/km). If the signal at the 
beginning ofja cable with —0.3 dB/km has a power of 2 mW, what is the power of the signal at 5 
km? 

Solution: 


The loss in the cable in decibels is 5 x (-0.3) =—1.5 dB. We can calculate the power as 





Ps 
dB = 10 logjo Pp. 


i 
P. 
= 21" S07i 


Pe 
P>=0.71P, =0.7%2=1.4mW 
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Distortion 

e Distortion means that the signal changes its form or shape. 

e Distortion can occur in a composite signal made of different frequencies. 

¢ Each signal component has its own propagation speed (see the next section) through a 
medium and, therefore, its own delay in arriving at the final destination. 

¢ Differences in delay may create a difference in phase if the delay is not exactly the same as 
the period duration. 

e In other words, signal components at the receiver have phases different from what they had at 


the sender. The shape of the composite signal is therefore not the same. 


























Composite signal Composite signal 
sent received 
Components, Components, 
in phase out of phase 
At the sender At the receiver 


Noise 

© Noise isanother cause of impairment. 

e Several types of noise, such as thermal noise, induced noise, crosstalk, and impulse noise, 
may corrupt the signal. 

e Thermal noise is the random motion of electrons in a wire which creates an extra signal not 
originally sent by the transmitter. 

e Induced noise comes from sources such as motors and appliances. These devices act as a 
sending antenna, and the transmission medium acts as the receiving antenna. 

e Crosstalk is the effect of one wire on the other. One wire acts as a sending antenna and the 


other as the receiving antenna. 
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e Impulse noise is a spike (a signal with high energy in a very short time) that comes from 


power lines, lightning, and so on. 


Transmitted Noise Received , 
i i 
! j ! 


Point 1 Transmission medium pointes 


Signal-to-Noise Ratio (SNR) 


SNR “average signal power 
average noise power 
SNR is actually the ratio of what is wanted (signal) to,what is not wanted (noise). 
A high SNR means the signal is less Corrupted by noisé;,a low SNR)means the signal is more 
corrupted by noise. 
Because SNR is the ratio of two powers, itis often described in decibel units, SNRgp, defined as 
SNRdb =10logi9 SNR 


2.5 Data Rate Limits 

Data rate depends)on three factors: 

1. The bandwidth available 

2. The level of the'signals we use 

3. The quality of the channel (the level of noise) 


Noiseless Channel: Nyquist Bit Rate 





For a noiseless channel, the Nyquist bit rate formula defines the theoretical maximum bit rate 
BitRate = 2 x bandwidth x 10log2 L 

In this formula, bandwidth is the bandwidth of the channel, L is the number of signal levels used 

to represent data, and BitRate is the bit rate in bits per second. 


Increasing the levels of a signal may reduce the reliability of the system. 
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Example: 
1) Consider a noiseless channel with a bandwidth of 3000 Hz transmitting a signal with two 


signal levels. The maximum bit rate can be calculated as 


BitRate = 2 x 3000 x logs 2 = 6000 bps 














2) Consider the same noiseless channel transmitting a signal with four signal levels (for each 


level, we send 2 bits). The maximum bit rate can be calculated as 


BitRate = 2 x 3000 x logs 4 = 12,000 bps 














3) We need to send 265 kbps over a noiseless channel with a bandwidth of 20 kHz. How many 
signal levels do we need? 


Solution: 





265,000 = 2 x 20,000 x logs L 
logs L= 6.625 L=2°° = 98.7 levels 


Since this result is not a power of 2)we need to either increase the number of levels or reduce the 











bit rate. If we have 128 levels, the bit rate is 280 kbps. If we have 64 levels, the bit rate is 240 
kbps. 





Noisy Channel: Shannon Capacity 





In reality, we cannot have a noiseless channel; the channel is always noisy. In 1944, Claude 
Shannon introduced a formula, called the Shannon capacity, to determine the theoretical highest 
data rate for a noisy channel: 

Capacity =bandwidth X log2 (1 +SNR) 
In this formula, bandwidth is the bandwidth of the channel, SNR is the signal-to noise ratio, and 


capacity is the capacity of the channel in bits per second. 
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Example: 
1) Consider an extremely noisy channel in which the value of the signal-to-noise ratio is almost 
zero. In other words, the noise is so strong that the signal is faint. For this channel the capacity C 


is calculated as 


C =B loga (1 + SNR) = B logy (1 + 0) = B logy 1= Bx 0=0 














This means that the capacity of this channel is zero regardless of the bandwidth. In other words, 
we cannot receive any data through this channel. 
2) We can calculate the theoretical highest bit rate of a regular telephone line. A telephone line 


normally has a bandwidth of 3000. The signal-to-noise ratiojis usually 3162; For this channel the 


capacity is calculated as 


C= B logs (1 + SNR) = 3000 logs (1 + 3162) = 3000 log, 3163 





= 3000 x 11.62 = 34,860 bps 


This means that the highest bit rate for atelephone line is.34.860 kbps. If we want to send data 





faster than this, we can either increase, the bandwidthof the line or improve the signal-to-noise 
ratio. 

3) The signal-to-noise ratio is often given in decibels. Assume that SNRdB = 36 and the channel 
bandwidth is 2 MHz. The theoretical channehcapacity can be calculated as 


SNRyp = 10 logigSNR = SNR = 1O°NRu/10 ee SNR = 103° = 3081 





C= B logs (1+ SNR) =2 X 10°X logs 3982 = 24 Mbps 





4) For practical purposes)when the SNR is very high, we can assume that SNR + 1 is almost the 

same as SNR. In these cases, the theoretical channel capacity can be simplified to 

SNRup 
3 





(C= Ie 25 











For example, we can calculate the theoretical capacity of the previous example as 


C=2 MHz x = 24 Mbps 
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2.6 Performance 


Bandwidth 
One characteristic that measures network performance is bandwidth. However, the term can be 
used in two different contexts with two different measuring values: bandwidth in hertz and 


bandwidth in bits per second. 


Bandwidth in Hertz 
Bandwidth in hertz is the range of frequencies contained in a composite signal or the range of 
frequencies a channel can pass. 


For example, we can say the bandwidth of a subscriber telephone line is 4 kHz) 


Bandwidth in Bits per Seconds 

The term bandwidth can also refer to the number of bits per second that a channel, a link, or even 
a network can transmit. 

For example, one can say the bandwidth of a Fast Ethernet network (or the links in this network) 


is a maximum of 100 Mbps»This means that this.networkcan send 100 Mbps. 


Relationship 
There is an. explicit relationship between the»bandwidth in hertz and bandwidth in bits per 
seconds. Basically, an increase,in bandwidthjin hertz means an increase in bandwidth in bits per 


second. 


Throughput 

The throughput is a measure’of how fast we can actually send data through a network. 

The bandwidth is a potential measurement of a link; the throughput is an actual measurement of 
how fast we can send data. 

For example, we may have a link with a bandwidth of 1 Mbps, but the devices connected to the 
end of the link may handle only 200 kbps. This means that we cannot send more than 200 kbps 
through this link. 


L 
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The latency or delay defines how long it takes for an entire message to completely arrive at the 
destination from the time the first bit is sent out from the source. 
Latency is made of four components: propagation time, transmission time, queuing time and 
processing delay. 

Latency =propagation time +transmission time +queuing time + processing delay 
Propagation Time 
Propagation time measures the time required for a bit to travel from the source to the destination. 
The propagation time is calculated by dividing the distance by the propagation speed. 
Propagation time = Distance / Propagation speed 
Transmission time 
The time required for transmission of a message depends on the size of the message and the 
bandwidth of the channel. 
Transmission time = Message size/ Bandwidth 
Queuing Time 
The third component in latency is'the queuing time, the time needed for each intermediate or end 
device to hold the message before it can be processed)The queuing time is not a fixed factor; it 
changes with the load imposed on the network. When there is heavy traffic on the network, the 
queuing time increases. An intermediateydevice, such as a router, queues the arrived messages 


and processes them one by one. If there,are many messages, each message will have to wait. 


Bandwidth Delay Product 


The bandwidth~delay product defines, the number of bits that can rdl the link. 
Length: delay 


Cross section: bandwidth —-{@)) sss Volumes bandwidth 2 clelaysss) 


Jitter 











Jitter is a variation in packet arrival time. 
For ex, If the delay for the first packet is 20 ms, for the second is 45 ms, and for the third is 40 


ms, then the real-time application that uses the packets endures jitter. 


2.7 Digital to Digital Conversion 
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Line coding is the process of converting digital data to digital signals. 


Line coding converts a sequence of bits to a digital signal. At the sender, digital data are encoded 


into a digital signal; at the receiver, the digital data are recreated by decoding the digital signal. 


Receiver 




































Digital data Digital data 
Digital signal 
0101 e*¢101 0101 ¢¢+101 
Line Coding Schemes 


NRZ 


NRZ, RZ, and biphase (Manchester, 
and differential Manchester) 


AMI and pseudoternary 


2B/1Q, 8B/6T, and 4D-PAMS 


MLT-3 





Unipolar Scheme 

In a unipolar scheme, all the signal levels are on one side of the time axis, either above or below. 
Traditionally, a unipolar scheme was designed as a non-return-to-zero (NRZ) scheme in which 
the positive voltage defines bit 1 and the zero voltage defines bit 0. It is called NRZ because the 


signal does not return to zero at the middle of the bit. 


Amplitude 





a Lave gor av | 


Time Normalized power 





Polar Schemes 
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e In polar schemes, the voltages are on the both sides of the time axis. For example, the voltage 
level for 0 can be positive and the voltage level for I can be negative. 

e Non-Return-to-Zero (NRZ): In polar NRZ encoding, we use two levels of voltage 
amplitude. We can have two versions of polar NRZ: NRZ-Land NRZ-I 

e In NRZ-L the level of the voltage determines the value of the bit. 


¢ In NRZ-I the inversion or the lack of inversion determines the value of the bit. 


' 
' 
' 


NRZ-L 1 | — | > 
Time 


1 1 Bandwidth 
= i +t —— . 
NRZ-1 + > 
' t 1 ' Time 
1 ' 














© Noinversion: Next bit isO — @ Inversion: Next bit is 1 


Return to Zero (RZ): 

¢ The main problem with NRZ encoding occurs when the sender and receiver clocks are not 
synchronized. The receiver does not know when one bithas ended and the next bit is starting. 

© One solution is the return-to-zero (RZ) scheme, which uses three values: positive, negative, 
and zero. 

e In RZ, the)signal changes not between bits butduring the bit. In the below Figure we see that 
the,signal goes)to 0 in the middle of each bit. It remains there until the beginning of the next 
bit. 

¢ The main disadvantage)of RZ encoding is that it requires two signal changes to encode a bit 


and therefore occupies)greater bandwidth. 
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Manchester and Differential Manchester: 
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e The idea of RZ (transition at the middle of the bit) and the idea of NRZ-L are combined into 
the Manchester scheme. 

e In Manchester encoding, the duration of the bit is divided into two halves. The voltage 
remains at one level during the first half and moves to the other level in the second half. The 
transition at the middle of the bit provides synchronization. 

e Differential Manchester on the other hand, combines the ideas of RZ and NRZ-I. There is 
always a transition at the middle of the bit, but the bit values are determined at the beginning 


of the bit. If the next bit is 0, there is a transition; if the nextbit is 1, there is none. 
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© No inversion: Next bitis 1 @ Inversion: Next bit is 0 

















Bipolar Schemes 

In bipolar encoding,(sometimes ealled mu/tilevel binary), there are three voltage levels: positive, 
negative, and zero. Thé)voltagelevel for one,data element is at zero, while the voltage level for 
the other element alternates betweempositiye and negative. 

AMI and Pseudoternary>A common bipolar encoding scheme is called bipolar alternate mark 
inversion (AMI). In the term alternate mark inversion, the word mark comes from telegraphy and 
means 1. SoAMPmeans alternate I inversion. A neutral zero voltage represents binary O. Binary 
Is are represented by alternating positive and negative voltages. A variation of AMI encoding is 
called pseudoternary in which the | bit is encoded as a zero voltage and the 0 bit is encoded as 


alternating positive and negative voltages. 


Amplitude 





AMI 


Bandwidth 


Pseudoternary 
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Module — 2 
PHYSICAL LAYER - 2 


2.1 Analog-To-Digital Conversion 
Pulse Code Modulation (PCM) 


The most common technique to change an analog signal to digital data (digitization) is called 
pulse code modulation (PCM). A PCM encoder has three processes. 

1. The analog signal is sampled. 

2. The sampled signal is quantized. 


3. The quantized values are encoded as streams of bits. 


Quantized signal 


[mom] 














PCM encoder 
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Digital data 
Analog signal 
PAM signal 
Sampling: 


e The first step in PCM is sampling. The analog signal is sampled every 7s s, where Ts is the 
sample interval or period. 

e The inverse of the sampling interval is called the sampling rate or sampling frequency and 
denoted by fs, where fs = //Ts. 


e There are three sampling methods-ideal, natural, and flat-top. 
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a. Ideal sampling 'b. Natural sampling 
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© Flattop sampling 

e In ideal sampling, pulses from the analog signal are sampled. 

e In natural sampling, a high-speed switch is turned on for only the small period of time when 
the sampling occurs. The result is a sequence of samples that retains the shape of the analog 
signal. 

e The most common sampling method, called sample and hold, however, creates flat-top 
samples by using a circuit. 

e The sampling process is sometimes referred to as pulse amplitude modulation (PAM). 

e Sampling Rate: According to the Nyquist theorem, to reproduce the original analog signal, 
one necessary condition is that the sampling rate be at least twice the highest frequency in the 


original signal. 


Quantization 

e The result of sampling is a series of pulses with amplitude values between the maximum and 
minimum amplitudes of the signal. 

e The set of amplitudes can be infinite with nonintegral values between the two limits. These 
values cannot be used in the encoding process. 

The following are the steps in quantization: 

1. We assume that the original analog signal has instantaneous amplitudes between Vmin and 

Vmax. 

2. We divide the range into L zones, each of height A (delta). 

A= (Vmax-Vmin)/L 


3. We assign quantized values of 0 to L - 1 to the midpoint of each zone. 
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4, We approximate the value of the sample amplitude to the quantized values. 


As a simple example, assume that we have a sampled signal and the sample amplitudes are 


between -20 and +20 V. We decide to have eight levels (L = 8). This means that A =5 V. 


Quantization Normalized 
codes amplitude 





oO 
Normalized -1.22 1.50 3.24 3.94 2.20 -1.10 -2.26 -1.88 -1.20 
PAM values 
Normalized -1.50 1.50 3.50 3.50 2.50 -1.50 -2.50 -1.50 -1.50 
quantized values 
Normalized -0.38 oO +0.26 -0.44 +0.30 -0.40 -0.24 +0.38 -0.30 
error 
Quantization code 2 5 z 7 6 2 1 2 2 
Encoded words 010 101 Ww mW 110 010 001 010 010 


Quantization Levels: In the previous example, we showed eight quantization levels. The choice 
of L, the number of levels, depends on the range of the amplitudes of the analog signal and how 
accurately we need to recover the signal. If the amplitude of a signal fluctuates between two 
values only, we need only two levels; if the signal, like voice, has many amplitude values, we 
need more quantization levels. In audio digitizing, L is normally chosen to be 256; in video it is 
normally thousands. Choosing lower values of L increases the quantization error if there is a lot 


of fluctuation in the signal. 


Quantization Error: One important issue is the error created in the quantization process. 
Quantization is an approximation process. The input values to the quantizer are the real values; 


the output values are the approximated values. The output values are chosen to be the middle 
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value in the zone. If the input value is also at the middle of the zone, there is no quantization 
error; otherwise, there is an error. 

The quantization error changes the signal-to-noise ratio of the signal, which in turn reduces the 
upper limit capacity according to Shannon. 

It can be proven that the contribution of the quantization error to the SNRap of the signal depends 
on the number of quantization levels L, or the bits per sample nb' as shown in the following 


formula: 
SNRgp =6.02nb + 1.76 dB 


Encoding: 


The last step in PCM is encoding. After each sample is quantized and the number of bits per 
sample is decided, each sample can be changed to an np-bit code word. 

A quantization code of 2 is encoded as 010; 5 is encoded as 101; and so on. Note that the number 
of bits for each sample is determined from the number of quantization levels. If the number of 
quantization levels is L, the number of bits is n, =log2 L. In our example L is 8 and np is 
therefore 3. The bit rate can be found from the formula 


Bit rate = sampling rate x number of bits per sample = fs x Np 





Amplitude 











PCM decoder 
11st+1100 
Digital data 


Delta Modulation 
PCM finds the value of the signal amplitude for each sample; DM finds the change from the 
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Analog signal 
































previous sample. 
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Modulator 


e The modulator is used at the sender site to create a stream of bits from an analog signal. 


e The process records the small positive or negative changes, called delta 6. If the delta is 
positive, the process records a 1; if it is negative, the process records a 0. 

e However, the process needs a base against which the analog signal is compared. The 
modulator builds a second signal that resembles a staircase. 

¢ Finding the change is then reduced to comparing the input signal with the gradually made 
staircase signal. 

e The modulator, at each sampling interval, compares the value of the analog signal with the 


last value of the staircase signal. If the amplitude of the analog signal is larger, the next bit in 


the digital data is 1; otherwise, it is 0. 


ee! 


Analog signal 


DM modulator 




















Tee" 1 TO0 
Digital data 























Demodulator 
The demodulator takes the digital data and, using the staircase maker and the delay unit, creates 
the analog signal. The created analog signal, however, needs to pass through a low-pass filter for 


smoothing. 
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2.9 Transmission Modes 


e The transmission of binary data across a link can be accomplished in either parallel or serial 
mode. 

e In parallel mode, multiple bits are sent with each clock tick. 

e  Inserial mode, | bit is sent with each clock tick. While there is only one way to send parallel 
data, there are three subclasses of serial transmission: asynchronous, synchronous, and 


isochronous. 











Data transmission 


Serial 











[ero | Cameo) [Lea | 


Binary data, consisting of 1s and 0s, may be organized into groups of n bits each. Computers 


Parallel Transmission 


produce and consume data in groups of bits much as we conceive of and use spoken language in 
the form of words rather than letters. By grouping, we can send data n bits at a time instead of 1. 


This is called parallel transmission. 
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The mechanism for parallel transmission is a conceptually simple one: Use n wires to send n bits 
at one time. That way each bit has its own wire, and all n bits of one group can be transmitted 
with each clock tick from one device to another. Figure shows how parallel transmission works 


for n =8. Typically, the eight wires are bundled in a cable with a connector at each end. 











Receiver 

















The advantage of parallel transmission is speed. All else being equal, parallel transmission can 
increase the transfer speed by a factor of n over serial transmission. 

But there is a significant disadvantage: cost. Parallel transmission requires n communication 
lines (wires in the example) just to transmit the data stream. Because this is expensive, parallel 
transmission is usually limited to short distances. 

Serial Transmission 

In serial transmission one bit follows another, so we need only one communication channel 
rather than n to transmit data between two communicating devices. 

The advantage of serial over parallel transmission is that with only one communication channel, 
serial transmission reduces the cost of transmission over parallel by roughly a factor of n. 

Since communication within devices is parallel, conversion devices are required at the interface 
between the sender and the line (parallel-to-serial) and between the line and the receiver (serial- 
to-parallel). 


Serial transmission occurs in one of three ways: asynchronous, synchronous, and isochronous. 
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Asynchronous Transmission 

e Asynchronous transmission is so named because the timing of a signal is unimportant. 
Instead, information is received and translated by agreed upon patterns. 

e As long as those patterns are followed, the receiving device can retrieve the information 
without regard to the rhythm in which it is sent. 

e Patterns are based on grouping the bit stream into bytes. Each group, usually 8 bits, is sent 
along the link as a unit. The sending system handles each group independently, relaying it to 
the link whenever ready, without regard to a timer. 

© Without synchronization, the receiver cannot use timing to predict when the next group will 
arrive. 

e To alert the receiver to the arrival of a new group, therefore, an extra bit is added to the 
beginning of each byte. This bit, usually a 0, is called the start bit. 

e To let the receiver know that the byte is finished, 1 or more additional bits are appended to 
the end of the byte. These bits, usually 1 s, are called stop bits. 

e By this method, each byte is increased in size to at least 10 bits, of which 8 bits is 
information and 2 bits or more are signals to the receiver. 

e In addition, the transmission of each byte may then be followed by a gap of varying duration. 
This gap can be represented either by an idle channel or by a stream of additional stop bits. 

e This mechanism is called asynchronous because, at the byte level, the sender and receiver do 
not have to be synchronized. But within each byte, the receiver must still be synchronized 


with the incoming bit stream. 
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e The addition of stop and start bits and the insertion of gaps into the bit stream make 
asynchronous transmission slower than forms of transmission that can operate without the 


addition of control information. But it is cheap and effective. 


Direction of flow 
——z Stop bit Start bit ———— 


Data 
11111011 


Sender Receiver 


01101 11111011 00010111 1 
Bb ie BH e 


‘Gaps between 
data units 











Synchronous Transmission 

e In synchronous transmission, the bit stream is combined into longer "frames," which may 
contain multiple bytes. 

e Each byte, however, is introduced onto the transmission link without a gap between it and the 


next one. It is left to the receiver to separate the bit stream into bytes for decoding purposes. 


Direction of flow 


Frame 


11111011] 11110110 11110111 





e The advantage of synchronous transmission is speed. With no extra bits or gaps to introduce 
at the sending end and remove at the receiving end, and, by extension, with fewer bits to 
move across the link, synchronous transmission is faster than asynchronous transmission. 

e Although there is no gap between characters in synchronous serial transmission, there may be 
uneven gaps between frames. 

Isochronous 

In real-time audio and video, in which uneven delays between frames are not acceptable, 

synchronous transmission fails. For example, TV images are broadcast at the rate of 30 images 

per second; they must be viewed at the same rate. If each image is sent by using one or more 


frames, there should be no delays between frames. For this type of application, synchronization 
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between characters is not enough; the entire stream of bits must be synchronized. The 


isochronous transmission guarantees that the data arrive at a fixed rate. 


2.2 Digital-To-Analog Conversion 


Digital-to-analog conversion is the process of changing one of the characteristics of an analog 


signal based on the information in digital data. 


Sender Receiver 















Analog signal 
































Digital data Digital data 
0101***101 6101 «9+ TOT 
a a. 

Modubetes Link 


e Sine wave is defined by three characteristics: amplitude, frequency, and phase. 

e When we vary anyone of these characteristics, we create a different version of that wave. So, 
by changing one characteristic of a simple electric signal, we can use it to represent digital 
data. 

e Any of the three characteristics can be altered in this way, giving us at least three 
mechanisms for modulating digital data into an analog signal: amplitude shift keying (ASK), 
frequency shift keying (FSK), and phase shift keying (PSK). 

e In addition, there is a fourth (and better) mechanism that combines changing both the 
amplitude and phase, called quadrature amplitude modulation (QAM). QAM is the most 


efficient of these options and is the mechanism commonly used today. 





Digital-to-analog 
conversion 
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Aspects of Digital-to-Analog Conversion 
e Data Element versus Signal Element: Data element is the smallest piece of information to 
be exchanged, that is the bit. Signal element is the smallest unit of a signal that is constant. 
e Data Rate Versus Signal Rate: The relationship between them is 
S=Nx I/r baud 
Where, N is the data rate (bps), r is the number of data elements carried in one signal 


element. r = logsL, where L is the type of signal element. 


Amplitude Shift Keying 
In amplitude shift keying, the amplitude of the carrier signal is varied to create signal elements. 


Both frequency and phase remain constant while the amplitude changes. 


Binary ASK (BASK) 
Although we can have several levels (kinds) of signal elements, each with a different amplitude, 
ASK is normally implemented using only two levels. This is referred to as binary amplitude shift 
keying or on-off keying (OOK). The peak amplitude of one signal level is 0; the other is the 
same as the amplitude of the carrier frequency. 

Amplitude Bit rate: 5 

1 






















1 Time Bandwidth 
Tsignal | 1signal | 1signal ; 1signal | 1signal 1 
element | element | element | element | element | 
Is = 
Baud rate: 5 
Bandwidth of ASK is 
B=(1 +d)xS 


Where S is the signal rate and the B is the bandwidth. The d depends on the modulation and 


filtering process. The value of d is between 0 and 1. 


Multilevel ASK 
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We can have multilevel ASK in which there are more than two levels. We can use 4,8, 16, or 
more different amplitudes for the signal and modulate the data using 2, 3, 4, or more bits at a 


time. 


Frequency Shift Keying 

In frequency shift keying, the frequency of the carrier signal is varied to represent data. The 
frequency of the modulated signal is constant for the duration of one signal element, but changes 
for the next signal element if the data element changes. Both peak amplitude and phase remain 


constant for all signal elements. 


Binary FSK (BFSK) 
One way to think about binary FSK (or BFSK) is to consider two carrier frequencies. In Figure, 
we have selected two carrier frequencies,f} and12. We use the first carrier if the data element is 
0; we use the second if the data element is 1. However, note that this is an unrealistic example 
used only for demonstration purposes. Normally the carrier frequencies are very high, and the 


difference between them is very small. 


Amplitude 


Bit rate: 5 





B=S(1 +d) +2Df 
S(1+d) S(1+ d) 
{ Time 
Isignal 1 Tsignal | signal 1 1signal 1 1signal | 


element | element | element | element | element | 





1s 
Baud rate: 5 








Bandwidth for BFSK: 
We can think of FSK as two ASK signals, each with its own carrier frequency (f7 or f2) If the 
difference between the two frequencies is 2Af then the required bandwidth is 

B=(I+d) x SH2Af 
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Phase Shift Keying 
In phase shift keying, the phase of the carrier is varied to represent two or more different signal 


elements. Both peak amplitude and frequency remain constant as the phase changes. 


Binary PSK (BPSK) 

The simplest PSK is binary PSK, in which we have only two signal elements, one with a phase 
of 0°, and the other with a phase of 180°. Below Figure gives a conceptual view of PSK. Binary 
PSK is as simple as binary ASK with one big advantage-it is less susceptible to noise. In ASK, 
the criterion for bit detection is the amplitude of the signal; in PSK, it is the phase. Noise can 
change the amplitude easier than it can change the phase. In other words, PSK is less susceptible 


to noise than ASK. PSK is superior to FSK because we do not need two carrier signals. 


Amplitude Bit rate: 5 








h Bandwidth 
1 Time k— | 


. hektid 1 
Tsignal 1 1signal 1 
element | element | 





Tsignal | 1 signal | 1 signal 


element | element | element 


1s 
Baud rate: 5 








Bandwidth: 
B=(1 +d)xS 
Where S is the signal rate and the B is the bandwidth. The d depends on the modulation and 


filtering process. The value of d is between 0 and 1. 


Quadrature PSK (QPSK): 

The simplicity of BPSK enticed designers to use 2 bits at a time in each signal element, thereby 
decreasing the baud rate and eventually the required bandwidth. The scheme is called quadrature 
PSK or QPSK because it uses two separate BPSK modulations; one is in-phase, the other 
quadrature (out-of-phase). The incoming bits are first passed through a serial-to-parallel 


conversion that sends one bit to one modulator and the next bit to the other modulator. If the 
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duration of each bit in the incoming signal is T, the duration of each bit sent to the corresponding 


BPSK signal is 2T. This means that the bit to each BPSK signal has one-half the frequency of the 


original signal. 
4 00 10 o1 WW 
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Quadrature Amplitude Modulation 
Quadrature amplitude modulation is a combination of ASK and PSK. The idea of using two 
carriers, one in-phase and the other quadrature, with different amplitude levels for each carrier is 


the concept behind quadrature amplitude modulation (QAM). 


BANDWIDTH UTILIZATION 


Bandwidth utilization is the wise use of available bandwidth to achieve specific goals. Efficiency 


can be achieved by multiplexing; privacy and anti jamming can be achieved by spreading. 


3.1 Multiplexing 
© Multiplexing is the set of techniques that allows the simultaneous transmission of multiple 
signals across a single data link. 


e Ina multiplexed system, n lines share the bandwidth of one link. 
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e Below figure shows the basic format of a multiplexed system. 


MUX: Multiplexer j. 
DEMUX: Demultiplexer 


: n Output 
. lines 





n Input A 
lines * 


xcEmo 


1 link, n channels 





e The lines on the left direct their transmission streams to a multiplexer (MUX), which 
combines them into a single stream (many-to one). 

e At the receiving end, that stream is fed into a demultiplexer (DEMUX), which separates the 
stream back into its component transmissions (one-to-many) and directs them to their 
corresponding lines. 

e In the figure, the word link refers to the physical path. The word channel refers to the portion 
of a link that carries a transmission between a given pair of lines. One link can have many (n) 
channels. 

e There are three basic multiplexing techniques: frequency-division multiplexing, wavelength- 


division multiplexing, and time-division multiplexing. 


Multiplexing 


Analog Analog Digital 





Frequency-Division Multiplexing 

e Frequency-division multiplexing (FDM) is an analog technique that can be applied when the 
bandwidth of a link (in hertz) is greater than the combined bandwidths of the signals to be 
transmitted. 


e In FDM, signals generated by each sending device modulate different carrier frequencies. 
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e These modulated signals are then combined into a single composite signal that can be 
transported by the link. 

e Carrier frequencies are separated by sufficient bandwidth to accommodate the modulated 
signal. These bandwidth ranges are the channels through which the various signals travel. 
Channels can be separated by strips of unused bandwidth-guard bands-to prevent signals 
from overlapping. 


e In addition, carrier frequencies must not interfere with the original data frequencies. 


Input 
lines 


Output 
lines 


mes 
xcE=mo 


Multiplexing Process 

Each source generates a signal of a similar frequency range. Inside the multiplexer, these similar 
signals modulates different carrier frequencies (f7,/2 and f3). The resulting modulated signals are 
then combined into a single composite signal that is sent out over a media link that has enough 


bandwidth to accommodate it. 
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Demultiplexing Process 

The demultiplexer uses a series of filters to decompose the multiplexed signal into its constituent 
component signals. The individual signals are then passed to a demodulator that separates them 
from their carriers and passes them to the output lines. 
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Examples: 

1) Assume that a voice channel occupies a bandwidth of 4 kHz. We need to combine three 
voice channels into a link with a bandwidth of 12 kHz, from 20 to 32 kHz. Show the 
configuration, using the frequency domain. Assume there are no guard bands. 

Solution 

We shift (modulate) each of the three voice channels to a different bandwidth, as shown in 
Figure 


Shift and combine 





































== m 
~<a — | — 
[a= |, ] 








Fiterand shite 
We use the 20- to 24-kHz bandwidth for the first channel, the 24- to 28-kHz bandwidth for the 
second channel, and the 28- to 32-kHz bandwidth for the third one. Then we combine them. At 
the receiver, each channel receives the entire signal, using a filter to separate out its own signal. 
The first channel uses a filter that passes frequencies between 20 and 24 kHz and filters out 
(discards) any other frequencies. The second channel uses a filter that passes frequencies 
between 24 and 28 kHz, and the third channel uses a filter that passes frequencies between 28 
and 32 kHz. Each channel then shifts the frequency to start from zero. 


2) Five channels, each with a 100-kHz bandwidth, are to be multiplexed together. What is 
the minimum bandwidth of the link if there is a need for a guard band of 10kHz between 
the channels to prevent interference? 

Solution: 
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For five channels, we need at least four guard bands. This means that the required bandwidth is 
at least 5 x 100 + 4 x 10 =540 kHz, as shown in Figure 
Guard band 
of 10 kHz 


_ ook || 100K jour F esl |. oom | 
















540 kHz | 


3) Four data channels (digital), each transmitting at I Mbps, use a satellite channel of I 
MHz. Design an appropriate configuration, using FDM. 

Solution 

The satellite channel is analog. We divide it into four channels, each channel having a 250-kHz 
bandwidth. Each digital channel of | Mbps is modulated such that each 4 bits is modulated to 1 
Hz. One solution is 16-QAM modulation. Figure shows one possible configuration. 


Digital 


1 Mbps 
Digital 





1 Mbps 
Digital 





1 Mbps 
Digital 





Application: 
The Analog Carrier System 
e To maximize the efficiency of infrastructure, telephone companies have traditionally 


multiplexed signals from lower-bandwidth lines onto higher-bandwidth lines. In this way, 
many switched or leased lines can be combined into fewer but bigger channels. For analog 
lines, FDM is used. 

e One of these hierarchical systems used by AT&T is made up of groups, supergroups, master 


groups, and jumbo groups. 





Abhijith H V, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 18 





Data Communication Module 2: Physical Layer-2 





e In this analog hierarchy, 12 voice channels are multiplexed onto a higher-bandwidth line to 
create a group. A group has 48 kHz of bandwidth and supports 12 voice channels. 

e At the next level, up to five groups can be multiplexed to create a composite signal called a 
supergroup. A supergroup has a bandwidth of 240 kHz and supports up to 60 voice channels. 
Supergroups can be made up of either five groups or 60 independent voice channels. 

e At the next level, 10 supergroups are multiplexed to create a master group. A master group 
must have 2.40 MHz of bandwidth, but the need for guard bands between the supergroups 
increases the necessary bandwidth to 2.52 MHz. Master groups support up to 600 voice 
channels. 

e Finally, six master groups can be combined into a jumbo group. A jumbo group must have 


15.12 MHz (6 x 2.52 MHz) but is augmented to 16.984 MHz to allow for guard bands 


between the master groups. 





12 voice channels 





F | Supergroup 





Master group 


10 supergroups 


6 master groups 





Other Applications of FDM 

e AM and FM radio broadcasting. 
Radio uses the air as the transmission medium. A special band from 530 to 1700 kHz is 
assigned to AM radio. All radio stations need to share this band. Each AM station needs 10 
kHz of bandwidth. Each station uses a different carrier frequency, which means it is shifting 
its signal and multiplexing. The signal that goes to the air is a combination of signals. A 
receiver receives all these signals, but filters (by tuning) only the one which is desired. 


Without multiplexing, only one AM station could broadcast to the common link, the air. 





Abhijith H V, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 19 


Data Communication Module 2: Physical Layer-2 





However, we need to know that there is physical multiplexer or demultiplexer here. 
Multiplexing is done at the data link layer. 
The situation is similar in FM broadcasting. However, FM has a wider band of 88 to 108 
MHz because each station needs a bandwidth of 200 kHz. 

e Television broadcasting. 
Each TV channel has its own bandwidth of 6 MHz. 

e The first generation of cellular telephones. 
Each user is assigned two 30-kHz channels, one for sending voice and the other for 
receiving. The voice signal, which has a bandwidth of 3 kHz (from 300 to 3300 Hz), is 
modulated by using FM. 








Example 

The Advanced Mobile Phone System (AMPS) uses two bands. The first band of 824 to 849 
MHz is used for sending, and 869 to 894 MHz is used for receiving. Each user has a 
bandwidth of 30 kHz in each direction. The 3-kHz voice is modulated using FM, creating 
30 kHz of modulated signal. How many people can use their cellular phones 
simultaneously? 

Solution: 

Each band is 25 MHz. If we divide 25 MHz by 30 kHz, we get 833.33. In reality, the band is 
divided into 832 channels. Of these, 42 channels are used for control, which means only 790 


channels are available for cellular phone users. 





Wavelength-Division Multiplexing 

© Wavelength-division multiplexing (WDM) is designed to use the high-data-rate capability of 
fiber-optic cable. 

e The optical fiber data rate is higher than the data rate of metallic transmission cable. Using a 
fiber-optic cable for one single line wastes the available bandwidth. Multiplexing allows us 
to combine several lines into one. 

e WDM is conceptually the same as FDM, except that the multiplexing and demultiplexing 


involve optical signals transmitted through fiber-optic channels. 
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e Very narrow bands of light from different sources are combined to make a wider band of 


light. At the receiver, the signals are separated by the demultiplexer. 





dy JX. JX dy 
veer it ANN fy, 
: ra Ay + Ag + Ag fe . 


e The combining and splitting of light sources are easily handled by a prism. Prism bends a 
beam of light based on the angle of incidence and the frequency. Using this technique, a 
multiplexer can be made to combine several input beams of light, each containing a narrow 
band of frequencies, into one output beam of a wider band of frequencies. A demultiplexer 


can also be made to reverse the process. 


Fiber-optic cable 





Multiplexer Demultiplexer 


e One application of WDM is the SONET network in which multiple optical fiber lines are 
multiplexed and demultiplexed. 
e Anew method, called dense WDM (DWDM), can multiplex a very large number of channels 


by spacing channels very close to one another. It achieves even greater efficiency. 


Synchronous Time-Division Multiplexing 

e = Time-division multiplexing (TDM) is a digital process that allows several connections to 
share the high bandwidth of a link. 

e Instead of sharing a portion of the bandwidth as in FDM, time is shared. Each connection 
occupies a portion of time in the link. 

e TDM is a digital multiplexing technique for combining several low-rate channels into one 


high-rate one. 
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Data flow 























TDM is divided into two different schemes: synchronous and statistical 


Time Slots and Frames 

In synchronous TDM, the data flow of each input connection is divided into units, where each 
input occupies one input time slot. A unit can be | bit, one character, or one block of data. Each 
input unit becomes one output unit and occupies one output time slot. However, the duration of 
an output time slot is 7 times shorter than the duration of an input time slot. If an input time slot 


is 7's, the output time slot is Tn s, where n is the number of connections. 


be a _ >I [a 
C31 B3;A3 |[ C2; B2;A2|[C TiAl 


Frame 3 Frame 2 Frame 1 
























Each frame is 3 time slots. 
Each time slot duration is T/3 s. 













Data are taken from each 
line every Ts. 


In synchronous TDM, a round of data units from each input connection is collected into a frame. 
If we have n connections, a frame is divided into n time slots and one slot is allocated for each 
unit, one for each input line. If the duration of the input unit is T, the duration of each slot is Tin 
and the duration of each frame is T. 

In synchronous TDM, the data rate of the link is n times faster, and the unit duration is n times 


shorter. 
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Time slots are grouped into frames. A frame consists of one complete cycle of time slots, with 
one slot dedicated to each sending device. In a system with n input lines, each frame has n slots, 


with each slot allocated to carrying data from a specific input line. 


Example: 
Figure shows synchronous TOM with a data stream for each input and one data stream for the 
output. The unit of data is | bit. Find (a) the input bit duration, (b) the output bit duration, (c) the 


output bit rate, and (d) the output frame rate. 


1 Mbps 
Frames 





1 Mbps 























1 Mbps 


1 Mbps 





Solution 

a. The input bit duration is the inverse of the bit rate: 1/1 Mbps = 1 ps. 

b. — The output bit duration is one-fourth of the input bit duration, or 1/4. 

c. The output bit rate is the inverse of the output bit duration or 1/4 1s, or 4 Mbps. This can 
also be deduced from the fact that the output rate is 4 times as fast as any input rate; so 
the output rate =4 x 1 Mbps =4 Mbps. 

d. The frame rate is always the same as any input rate. So the frame rate is 1,000,000 frames 
per second. Because we are sending 4 bits in each frame, we can verify the result of the 


previous question by multiplying the frame rate by the number of bits per frame. 


Statistical Time-Division Multiplexing 





In statistical time-division multiplexing, slots are dynamically allocated to improve bandwidth 
efficiency. Only when an input line has a slot's worth of data to send is it given a slot in the 
output frame. In statistical multiplexing, the number of slots in each frame is less than the 


number of input lines. The multiplexer checks each input line in round robin fashion; it allocates 
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a slot for an input line if the line has data to send; otherwise, it skips the line and checks the next 


line. 





Line A 
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a. Synchronous TDM 
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b. Statistical TDM 

3.2 Spread Spectrum 

e Multiplexing combines signals from several sources to achieve bandwidth efficiency; the 
available bandwidth of a link is divided between the sources. 

e In spread spectrum, we also combine signals from different sources to fit into a larger 
bandwidth. 

e Spread spectrum is designed to be used in wireless applications (LANs and WANs). 

e In wireless applications, all stations use air (or a vacuum) as the medium for communication. 
Stations must be able to share this medium without interception by an eavesdropper and 
without being subject to jamming from a malicious intruder (in military operations, for 
example). 

e If the required bandwidth for each station is B, spread spectrum expands it to Bss such that 
Bss » B. The expanded bandwidth allows the source to wrap its message in a protective 


envelope for a more secure transmission. 
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Spreading 


process 





Spreading 
code 


Spread spectrum achieves its goals through two principles: 
1. The bandwidth allocated to each station needs to be, by far, larger than what is needed. This 


allows redundancy. 


Nv 


The expanding of the original bandwidth B to the bandwidth Bss must be done by a process 
that is independent of the original signal. In other words, the spreading process occurs after 


the signal is created by the source. 


After the signal is created by the source, the spreading process uses a spreading code and spreads 
the bandwidth. The figure shows the original bandwidth B and the spreaded bandwidth Bss. The 


spreading code is a series of numbers that look random, but are actually a pattern. 


There are two techniques to spread the bandwidth: 
1. Frequency hopping spread spectrum (FHSS) 
2. Direct sequence spread spectrum (DSSS) 


Frequency Hopping Spread Spectrum (FHSS 

e The frequency hopping spread spectrum (FHSS) technique uses M different carrier 
frequencies that are modulated by the source signal. 

e At one moment, the signal modulates one carrier frequency; at the next moment, the signal 
modulates another carrier frequency. Although the modulation is done using one carrier 
frequency at a time, M frequencies are used in the long run. 

e The bandwidth occupied by a source after spreading is BpHSS »B. 

The general layout for FHSS is shown below: 
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Original 
signal 


> Spread 
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Frequency 
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Frequency table 











Pseudorandom 
code generator 

















e A pseudorandom code generator, called pseudorandom noise (PN), creates a k-bit pattern for 
every hopping period 7h. 

e The frequency table uses the pattern to find the frequency to be used for this hopping period 
and passes it to the frequency synthesizer. 

e The frequency synthesizer creates a carrier signal of that frequency, and the source signal 
modulates the carrier signal. 

e Suppose we have decided to have eight hopping frequencies. This is extremely low for real 
applications and is just for illustration. In this case, M is 8 and k is 3. The pseudorandom 
code generator will create eight different 3-bit patterns. These are mapped to eight different 


frequencies in the frequency table. 


First-hop frequency 






k-bit patterns 





101 111 001 O00 010 110 011 100 








First selection 


Frequency table 
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The pattern for this station is 101, 111, 001, 000, 010, all, 100. Note that the pattern is 
pseudorandom it is repeated after eight hoppings. This means that at hopping period 1, the 
pattern is 101. The frequency selected is 700 kHz; the source signal modulates this carrier 
frequency. The second k-bit pattern selected is 111, which selects the 900-kHz carrier; the eighth 
pattern is 100, the frequency is 600 kHz. After eight hoppings, the pattern repeats, starting from 
101 again. Figure shows how the signal hops around from carrier to carrier. We assume the 


required bandwidth of the original signal is 100 kHz. 


Carrier 
frequencies 
(kHz) 


Cycle 1 Cycle 2 




































































> 
12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 Hop 
periods 


Bandwidth Sharing 

If the number of hopping frequencies is M, we can multiplex M channels into one by using the 
same Bss bandwidth. This is possible because a station uses just one frequency in each hopping 
period; M - 1 other frequencies can be used by other M - 1 stations. In other words, M different 
stations can use the same Bss if an appropriate modulation technique such as multiple FSK 


(MFSK) is used. 
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a. FDM b. FHSS 








Above Figure shows an example of four channels using FDM and four channels using FHSS. In 
FDM, each station uses 11M of the bandwidth, but the allocation is fixed; in FHSS, each station 
uses 11M of the bandwidth, but the allocation changes hop to hop. 

Direct Sequence Spread Spectrum 

The direct sequence spread spectrum (DSSS) technique also expands the bandwidth of the 
original signal, but the process is different. 

In DSSS, we replace each data bit with 11 bits using a spreading code. In other words, each bit is 


assigned a code of 11 bits, called chips, where the chip rate is 11 times that of the data bit. 





Modulator 


Spread 
signal 


Original 
signal 





As an example, let us consider the sequence used in a wireless LAN, the famous Barker 
sequence where 11 is 11. We assume that the original signal and the chips in the chip generator 
use polar NRZ encoding. Below Figure shows the chips and the result of multiplying the original 
data by the chips to get the spread signal. In Figure, the spreading code is 11 chips having the 
pattern 10110111000 (in this case). If the original signal rate is N, the rate of the spread signal is 
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IN. This means that the required bandwidth for the spread signal is 11 times larger than the 
bandwidth of the original signal. The spread signal can provide privacy if the intruder does not 
know the code. It can also provide immunity against interference if each station uses a different 


code. 
Original 
signal 


Spreading 
code 


Spread 
signal 





Switching 
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A switched network consists of a series of interlinked nodes, called switches. 


Switches are devices capable of creating temporary connections between two or more devices 
linked to the switch. 


In a switched network, some of these nodes are connected to the end systems. Others are used 
only for routing. 
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Switched networks can be divided into three broad categories: circuit-switched networks, packet- 


switched networks, and message-switched. Packet-switched networks can further be divided into 


two subcategories-virtual-circuit networks and datagram networks. 


Switched 
networks 













irc switched Packet-switched 


networks 


networks 








Message-switched 
networks 








Datagram Virtual-circuit 
networks networks 


3.3 Circuit Switched Networks 


e A circuit-switched network consists of a set of switches connected by physical links. 


e A connection between two stations is a dedicated path made of one or more links. However, 


each connection uses only one dedicated channel on each link. 
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¢ Each link is normally divided into n channels by using FDM or TDM. 
e Below Figure shows a trivial circuit-switched network with four switches and four links. 


Each link is divided into n (n is 3 in the figure) channels by using FDM or TDM. 


| fF 








One link, n channels 











¢ Circuit switching takes place at the physical layer. 

e Before starting communication, the stations must make a reservation for the resources to be 
used during the communication. These resources, such as channels (bandwidth in FDM and 
time slots in TDM), switch buffers, switch processing time, and switch input/output ports, 
must remain dedicated during the entire duration of data transfer until the teardown phase. 

e Data transferred between the two stations are not packetized. The data are a continuous flow 
sent by the source station and received by the destination station, although there may be 
periods of silence. 

e There is no addressing involved during data transfer. The switches route the data based on 


their occupied band (FDM) or time slot (TDM). 


Three Phases 

The actual communication in a circuit-switched network requires three phases: connection setup, 
data transfer, and connection teardown. 

1) Setup Phase 

Before the two parties can communicate, a dedicated circuit needs to be established. The end 
systems are normally connected through dedicated lines to the switches, so connection setup 


means creating dedicated channels between the switches. 
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In the next step to making a connection, an acknowledgment from system M needs to be sent in 
the opposite direction to system A. Only after system A receives this acknowledgment is the 


connection established. 








One link, n channels 


H 








‘If 





2) Data Transfer Phase 


After the establishment of the dedicated circuit (channels), the two parties can transfer data. 


3) Teardown Phase 
When one of the parties needs to disconnect, a signal is sent to each switch to release the 


resources. 


Efficiency 
Circuit-switched networks are not as efficient as the other two types of networks because 
resources are allocated during the entire duration of the connection. These resources are 


unavailable to other connections. 


Delay 
Although a circuit-switched network normally has low efficiency, the delay in this type of 
network is minimal. During data transfer the data are not delayed at each switch; the resources 


are allocated for the duration of the connection. 
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Time Time Time Time 

e As Figure shows, there is no waiting time at each switch. The total delay is due to the time 
needed to create the connection, transfer data, and disconnect the circuit. 

e The delay caused by the setup is the sum of four parts: the propagation time of the source 
computer request, the request signal transfer time, the propagation time of the 
acknowledgment from the destination computer, and the signal transfer time of the 
acknowledgment. 

e The delay due to data transfer is the sum of two parts: the propagation time and data transfer 
time, which can be very long. 


¢ The third box shows the time needed to tear down the circuit. 


3.4 Datagram Networks 


e In data communications, we need to send messages from one end system to another. 

e Ifthe message is going to pass through a packet-switched network, it needs to be divided into 
packets of fixed or variable size. 

e The size of the packet is determined by the network and the governing protocol. 

e In packet switching, there is no resource allocation for a packet. This means that there is no 
reserved bandwidth on the links, and there is no scheduled processing time for each packet. 

e Resources are allocated on demand. The allocation is done on a first come, first-served basis. 
When a switch receives a packet, no matter what is the source or destination, the packet must 


wait if there are other packets being processed. 
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e Ina datagram network, each packet is treated independently of all others. Even if a packet is 
part of a multipacket transmission, the network treats it as though it existed alone. Packets in 
this approach are referred to as datagrams. 

e Datagram switching is normally done at the network layer. 

e The datagram networks are sometimes referred to as connectionless networks. The term 
connectionless here means that the switch (packet switch) does not keep information about 
the connection state. There are no setup or teardown phases. Each packet is treated the same 


by a switch regardless of its source or destination. 


Example: Below Figure shows how the datagram approach is used to deliver four packets from 


station A to station X. The switches in a datagram network are traditionally referred to as routers. 


Datagram network 



















































































In this example, all four packets (or datagrams) belong to the same message, but may travel 
different paths to reach their destination. This is so because the links may be involved in carrying 
packets from other sources and do not have the necessary bandwidth available to carry all the 
packets from A to X. This approach can cause the datagrams of a transmission to arrive at their 
destination out of order with different delays between the packets. Packets may also be lost or 


dropped because of a lack of resources. 


Routing Table 
Each switch (or packet switch) has a routing table which is based on the destination address. The 
routing tables are dynamic and are updated periodically. The destination addresses and the 


corresponding forwarding output ports are recorded in the tables. 
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Destination Address 

e Every packet in a datagram network carries a header that contains the destination address of 
the packet. 

© When the switch receives the packet, this destination address is examined; the routing table is 
consulted to find the corresponding port through which the packet should be forwarded. 

e The destination address in the header of a packet in a datagram network remains the same 


during the entire journey of the packet. 


Efficiency 

The efficiency of a datagram network is better than that of a circuit-switched network; resources 
are allocated only when there are packets to be transferred. If a source sends a packet and there is 
a delay of a few minutes before another packet can be sent, the resources can be reallocated 


during these minutes for other packets from other sources. 


Delay 
There may be greater delay in a datagram network than in a virtual-circuit network. Although 
there are no setup and teardown phases, each packet may experience a wait at a switch before it 
is forwarded. In addition, since not all packets in a message necessarily travel through the same 
switches, the delay is not uniform for the packets of a message. 


Below figure shows an example of delay in a datagram network for one single packet. 








Abhijith H V, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 35 











Transmission 
time 






Total delay 





Waiting 
time 
Time Time Time Time 


The packet travels through two switches. There are three transmission times (3T), three 
propagation delays (slopes 3 of the lines), and two waiting times (W1 + W2)' We ignore the 
processing time in each switch. The total delay is 


Total delay =3T + 3t + W1 + W2 


3.5 Virtual-Circuit Networks 
A virtual-circuit network is a cross between a circuit-switched network and a datagram network. 
It has some characteristics of both. 

1. As in a circuit-switched network, there are setup and teardown phases in addition to the 
data transfer phase. 

2. Resources can be allocated during the setup phase, as in a circuit-switched network, or on 
demand, as in a datagram network. 

3. As in a datagram network, data are packetized and each packet carries an address in the 
header. However, the address in the header has local jurisdiction, not end-to-end 
jurisdiction. 

4. As ina circuit-switched network, all packets follow the same path established during the 
connection. 

5. A virtual-circuit network is normally implemented in the data link layer, while a circuit- 
switched network is implemented in the physical layer and a datagram network in the 
network layer. 

Below Figure is an example of a virtual-circuit network. The network has switches that allow 
traffic from sources to destinations. A source or destination can be a computer, packet switch, 


bridge, or any other device that connects other networks. 
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Addressing 

In a virtual-circuit network, two types of addressing are involved: global and local (virtual-circuit 
identifier). 

Global Addressing 

A source or a destination needs to have a global address-an address that can be unique in the 
scope of the network or internationally if the network is part of an international network. 
Virtual-Circuit Identifier 

The identifier that is actually used for data transfer is called the virtual-circuit identifier (VCI). A 
VCI is a small number that has only switch scope; it is used by a frame between two switches. 
When a frame arrives at a switch, it has a VCI; when it leaves, it has a different VCI. Below 


Figure shows how the VCI in a data frame changes from one switch to another. 


vcl vcl 








Three Phases 

Data Transfer Phase 

To transfer a frame from a source to its destination, all switches need to have a table entry for 
this virtual circuit. The table, in its simplest form, has four columns. This means that the switch 
holds four pieces of information for each virtual circuit that is already set up. 


Below Figure shows such a switch and its corresponding table. 
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Figure shows a frame arriving at port | with a VCI of 14. When the frame arrives, the switch 
looks in its table to find port 1 and a VCI of 14. When it is found, the switch knows to change the 
VCI to 22 and send out the frame from port 3. 

Below Figure shows how a frame from source A reaches destination B and how its VCI changes 


during the trip. Each switch changes the VCI and routes the frame. 
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The data transfer phase is active until the source sends all its frames to the destination. 

The procedure at the switch is the same for each frame of a message. The process creates a 
virtual circuit, not a real circuit, between the source and destination. 

Setup Phase 

In the setup phase, a switch creates an entry for a virtual circuit. 

For example, suppose source A needs to create a virtual circuit to B. Two steps are required: the 


setup request and the acknowledgment. 
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Setup Request: A setup request frame is sent from the source to the destination. 
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Setup request in a virtual-circuit network 
Source A sends a setup frame to switch 1. 
Switch 1 receives the setup request frame. It knows that a frame going from A to B goes out 
through port 3. The switch, in the setup phase, acts as a packet switch; it has a routing table 
which is different from the switching table. For the moment, assume that it knows the output 
port. The switch creates an entry in its table for this virtual circuit, but it is only able to fill 
three of the four columns. The switch assigns the incoming port (1) and chooses an available 
incoming VCI (14) and the outgoing port (3). It does not yet know the outgoing VCI, which 
will be found during the acknowledgment step. The switch then forwards the frame through 
port 3 to switch 2. 
Switch 2 receives the setup request frame. The same events happen here as at switch 1; three 
columns of the table are completed: in this case, incoming port (1), incoming VCI (66), and 
outgoing port (2). 
Switch 3 receives the setup request frame. Again, three columns are completed: incoming 
port (2), incoming VCI (22), and outgoing port (3). 
Destination B receives the setup frame, and if it is ready to receive frames from A, it assigns 
a VCI to the incoming frames that come from A, in this case 77. This VCI lets the destination 


know that the frames come from A, and not other sources. 


Acknowledgment: A special frame, called the acknowledgment frame, completes the entries in 


the switching tables. 
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Setup acknowledgment in a virtual-circuit network 


The destination sends an acknowledgment to switch 3. The acknowledgment carries the 
global source and destination addresses so the switch knows which entry in the table is to be 
completed. The frame also carries VCI 77, chosen by the destination as the incoming VCI for 
frames from A. Switch 3 uses this VCI to complete the outgoing VCI column for this entry. 
Note that 77 is the incoming VCI for destination B, but the outgoing VCI for switch 3. 
Switch 3 sends an acknowledgment to switch 2 that contains its incoming VCI in the table, 
chosen in the previous step. Switch 2 uses this as the outgoing VCI in the table. 

Switch 2 sends an acknowledgment to switch | that contains its incoming VCI in the table, 
chosen in the previous step. Switch 1 uses this as the outgoing VCI in the table. 

Finally switch 1 sends an acknowledgment to source A that contains its incoming VCI in the 
table, chosen in the previous step. 


e. The source uses this as the outgoing VCI for the data frames to be sent to destination B. 


Teardown Phase 

In this phase, source A, after sending all frames to B, sends a special frame called a teardown 
request. Destination B responds with a teardown confirmation frame. All switches delete the 
corresponding entry from their tables. 

Efficiency 

Resource reservation in a virtual-circuit network can be made during the setup or can be on 


demand during the data transfer phase. 
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In the first case, the delay for each packet is the same; in the second case, each packet may 
encounter different delays. There is one big advantage in a virtual-circuit network even if 
resource allocation is on demand. The source can check the availability of the resources, without 


actually reserving it. 


Delay in Virtual-Circuit Networks 





In a virtual-circuit network, there is a one-time delay for setup and a one-time delay for 


teardown. 
If resources are allocated during the setup phase, there is no wait time for individual packets. 


Below Figure shows the delay for a packet traveling through two switches in a virtual-circuit 


network. 















Total delay 


Teardown 


Time Time Time Time 
The packet is traveling through two switches (routers). There are three transmission times (3T), 
three propagation times (3t), data transfer depicted by the sloping lines, a setup delay (which 
includes transmission and propagation in two directions), and a teardown delay (which includes 
transmission and propagation in one direction). We ignore the processing time in each switch. 


The total delay time is 
Total delay = 3T+ 31 + setup delay + teardown delay 
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Module-3 
DATA LINK LAYER 


Error Detection and Correction 


Data can be corrupted during transmission. Some applications require that errors be detected and 
corrected. 


3.1 Introduction 


Types of Errors 
There are two types of error: Single bit error and Burst error. 
1) Single-Bit Error 


The term single-bit error means that only | bit of a given data unit (such as a byte, character, or 
packet) is changed from | to 0 or from 0 to 1. Single-bit errors are the least likely type of error in 
serial data transmission. 


O changed to 1 





o[o[ofofofo[1[o}—>o][oJo]Jo[iJo]1[o 
Sent Received 



































2) Burst Error 


The term burst error means that 2 or more bits in the data unit have changed from | to 0 or from 
0 to 1. In the below figure, 0100010001000011 was sent, but 0101110101100011 was received. 


Length of burst 
error (8 bits) 
Sent 


of1 [ofofo]ifofofofifofofofofifi 


Corrupted bits 
ofifo ifo ofa ofofofif1 


Received 



























































Note that a burst error does not necessarily mean that the errors occur in consecutive bits. The 
length of the burst is measured from the first corrupted bit to the last corrupted bit. Some bits in 
between may not have been corrupted. 
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Redundancy 


To detect or correct errors some extra bits are sent with data. These redundant bits are added by 


the sender and removed by the receiver. Their presence allows the receiver to detect or correct 


corrupted bits. 


Detection versus Correction 


The correction of errors is more difficult than the detection. 

In error detection, we are looking only to see if any error has occurred. 

In error correction, we need to know the exact number of bits that are corrupted and more 
importantly, their location in the message. The number of the errors and the size of the 
message are important factors. 

If we need to correct one single error in an 8-bit data unit, we need to consider eight possible 
error locations; if we need to correct two errors in a data unit of the same size, we need to 
consider 28 possibilities. 


Forward Error Correction versus Retransmission 


There are two main methods of error correction. 


Forward error correction is the process in which the receiver tries to guess the message by 
using redundant bits. This is possible, if the number of errors is small. 





Correction by retransmission is a technique in which the receiver detects the occurrence of 
an error and asks the sender to resend the message. Resending is repeated until a message 
arrives that the receiver believes is error-free. 


Codin 


Redundancy is achieved through various coding schemes. 

The sender adds redundant bits through a process that creates a relationship between the 
redundant bits and the actual data bits. 

The receiver checks the relationships between the two sets of bits to detect or correct the 
errors. 

Coding schemes can be divided into two broad categories: block coding and convolution 
coding. 
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Sender Receiver 























Message Message 











Correct or 
discard 


Unreliable transmission 





Modular Arithmetic 
In modular arithmetic only integers in the range 0 to N-1 is used. This is known as modulo-N 
arithmetic. 


For example, if the modulus is 12, we use only the integers 0 to 11, inclusive. 


Modulo-2 Arithmetic 

In this arithmetic, the modulus N is 2. We can use only 0 and 1. Operations in this arithmetic are 
very simple. The following shows how we can add or subtract 2 bits. 

Adding: 

0+0=0 O+1=1 1+0=1 1+1=0 

Subtracting: 

0-0=0 0-1=1 1-0=1 1-1=0 

In this arithmetic we use the XOR (exclusive OR) operation for both addition and subtraction. 


The result of an XOR operation is 0 if two bits are the same; the result is I if two bits are 


























different. 
0@o0=0 1@1=0 
3 - 1 0 1 1 0 
a. Two bits are the same, the result is 0. @1 1 1.0 
o 1 ie} 1 
o@1=1 1@0=1 
b. Two bits are different, the result is 1. c. Result of XORing two patterns 
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3.2 Block Coding 


e In block coding message is divided into k bits blocks called datawords. Then + redundant 
bits are added to each block to make the length n = k +r. The resulting n-bit blocks are called 
codewords. 

e With k bits, we can create a combination of 2k datawords; with n bits, we can create a 
combination of 2n codewords. 

e Since n > k, the number of possible codewords is larger than the number of possible 
datawords. 

e The block coding process is one-to-one; the same dataword is always encoded as the same 
codeword. This means that we have 2n - 2k codewords that are not used. We call these 


codewords invalid or illegal. 




















2" Codewords, each of n bits (only 2k of them are valid) 


Error Detection 

If the following two conditions are met, the receiver can detect a change in the original 
codeword. 

1, The receiver has (or can find) a list of valid codewords. 


2. The original codeword has changed to an invalid one. 


Sender Receiver 











Dataword k bits 











Extract 











Generator | Checker 
Discard 


Yv 





Unreliable transmission 





Process of error detection in block coding 
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© The sender creates codewords out of datawords by using a generator that applies the rules 
and procedures of encoding. 

e Each codeword sent to the receiver may change during transmission. 

e If the received codeword is the same as one of the valid codewords, the word is accepted; the 
corresponding dataword is extracted for use. If the received codeword is not valid, it is 
discarded. 

e However, if the codeword is corrupted during transmission but the received word still 
matches a valid codeword, the error remains undetected. 

e This type of coding can detect only single errors. Two or more errors may remain undetected. 

Example: 


Let us assume that k =2 and n =3. Below Table shows the list of datawords and codewords. 

















Dataword Codeword 
00 000 
01 oll 
10 101 
11 110 














Assume the sender encodes the dataword 01 as 011 and sends it to the receiver. Consider the 

following cases: 

1. The receiver receives 011. It is a valid codeword. The receiver extracts the dataword 01 from 
it. 

2. The codeword is corrupted during transmission, and 111 is received (the leftmost bit is 
corrupted). 
This is not a valid codeword and is discarded. 

3. The codeword is corrupted during transmission, and 000 is received (the right two bits are 
corrupted). This is a valid codeword. The receiver incorrectly extracts the dataword 00. Two 


corrupted bits have made the error undetectable. 


Error Correction 
In error detection, the receiver needs to know only that the received codeword is invalid; in error 


correction the receiver needs to find (or guess) the original codeword sent. 
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Sender Receiver 
Encoder 

















Decoder 





k bits] Dataword 











Generator 





Unreliable transmission 








Below Table shows the datawords and codewords. 

















Dataword Codeword 
00 00000 
01 01011 
10 10101 
11 11110 














Assume the dataword is 01. The sender consults the table (or uses an algorithm) to create the 

codeword 01011. 

The codeword is corrupted during transmission, and 01001 is received (error in the second bit 

from the right). 

First, the receiver finds that the received codeword is not in the table. This means an error has 

occurred. (Detection must come before correction.) 

The receiver, assuming that there is only | bit corrupted, uses the following strategy to guess the 

correct dataword. 

1. Comparing the received codeword with the first codeword in the table (01001 versus 00000), 
the receiver decides that the first codeword is not the one that was sent because there are two 
different bits. 

2. By the same reasoning, the original codeword cannot be the third or fourth one in the table. 

3. The original codeword must be the second one in the table because this is the only one that 
differs from the received codeword by | bit. The receiver replaces 01001 with 01011 and 
consults the table to find the dataword 01. 
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Hamming Distance 

The Hamming distance between two words (of the same size) is the number of differences 
between the corresponding bits. Hamming distance between two words x and y is represented as 
d(x, y). The Hamming distance can be found by applying the XOR operation on the two words 
and counting the number of 1s in the result. 

Example: 

1. The Hamming distance d(000, 011) is 2 because 000 ® 011 is 011 (two Is). 

2. The Hamming distance d(10101, 11110) is 3 because 10101 11110 is 01011 (three 1s). 
Minimum Hamming Distance: The minimum Hamming distance is the smallest Hamming 


distance between all possible pairs in a set of words. It is represented as din. 

















Example 1 
Find the minimum Hamming distance of the coding scheme in below table: 
Dataword Codeword 
00 000 
ol oll 
10 101 
11 110 














Solution 
(000,011) =2, d(000,101) = 2, d(000,110) = 2, (011,101) = 2, d(011,110) = 2, d(101,110) =2 


The dyin in this case is 2. 

















Example 2 
Find the minimum Hamming distance of the coding scheme in below table: 
Dataword Codeword 
00 00000 
Ol 01011 
10 10101 
11 11110 














Solution 
d(00000,01011) = 3, d(00000,10101) = 3, d(00000,11110) =4 
d(01011,10101) = 4, d(01011,11110) = 3, d(10101,11110) =3 


The dmin in this case is 3. 
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Coding scheme needs to have at least three parameters: the codeword size n, the dataword size k, 
and the minimum Hamming distance dmin. A coding scheme C is written as C(n, k) with a 
separate expression for dmin. 


Ex: C(5, 2) with dmin = 3. 


Hamming Distance and Error 

e When a codeword is corrupted during transmission, the Hamming distance between the sent 
and received codewords is the number of bits affected by the error. 

e The Hamming distance between the received codeword and the sent codeword is the number 
of bits that are corrupted during transmission. 

e For example, if the codeword 00000 is sent and 01101 is received, 3 bits are in error and the 


Hamming distance between the two is d(00000, 01101) =3. 


Minimum Distance for Error Detection 

e If S errors occur during transmission, the Hamming distance between the sent codeword and 
received codeword is S. 

e To guarantee the detection of up to S errors in all cases, the minimum Hamming distance in a 
block code must be dmin =S + 1. 

¢ Let us assume that the sent codeword x is at the center of a circle with radius S. All other 
received codewords that are created by | to S errors are points inside the circle or on the 


perimeter of the circle. All other valid codewords must be outside the circle. 





Minimum Distance for Error Correction 
e When a received codeword is not a valid codeword, the receiver needs to decide which valid 
codeword was actually sent. The decision is based on the concept of territory, an exclusive 


area surrounding the codeword. Each valid codeword has its own territory. 
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© We use a geometric approach to define each territory. We assume that each valid codeword 
has a circular territory with a radius of ¢ and that the valid codeword is at the center. 

e For example, suppose a codeword x is corrupted by f bits or less. Then this corrupted 
codeword is located either inside or on the perimeter of this circle. If the receiver receives a 
codeword that belongs to this territory, it decides that the original codeword is the one at the 
center. 

e To guarantee correction of up to t errors in all cases, the minimum Hamming distance in a 


block code must be dmin == 2t + 1. 


Territory of x Territory of y 





3.3 Linear Block Codes 


Linear block code is a code in which the exclusive OR of two valid codewords creates another 
valid codeword. 
Minimum Distance for Linear Block Codes: The minimum Hamming distance is the number 


of Is in the nonzero valid codeword with the smallest number of Is. 


Some Linear Block Codes 


1) Simple Parity-Check Code 


e In this code, a A-bit dataword is changed to an n-bit codeword where n = k + 1. The extra bit, 
called the parity bit, is selected to make the total number of Is in the codeword even. 


© A simple parity-check code is a single-bit error-detecting code in which n =k + 1 with dmin =2. 
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Sender Receiver 
Encoder Decoder 
Dataword Dataword 
fealea]2s]a0) [22] ]20 
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syndrome 








Generator 

















Unreliable 
transmission 

















Codeword Codeword 











e The encoder uses a generator that takes a copy of a 4-bit dataword (a, a), a2, and a3) and 
generates a parity bit Ip. 


e The dataword bits and the parity bit create the 5-bit codeword. The parity bit that is added 
makes the number of Is in the codeword even. 


Example: Simple parity-check code C(5, 4) 





























Datawords Codewords Datawords Codewords 
0000 00000 000 10001 
0001 00011 001 10010 
0010 00101 010 10100 
0011 00110 Oll 10111 
0100 01001 100 11000 
OLOL 01010 101 11011 
O110 01100 110 11101 
OL1L O1l11 111 11110 























e This is normally done by adding the 4 bits of the dataword (modulo-2); the result is the parity 


bit. In other words, 





To=a3t+ayta;tag (modulo — 2) 
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e Ifthe number of Is is even, the result is 0; if the number of 1s is odd, the result is 1. In both 
cases, the total number of Is in the codeword is even. 

e The sender sends the codeword which may be corrupted during transmission. 

e The receiver receives a 5-bit word. 

e The checker at the receiver does the same thing as the generator in the sender with one 
exception: The addition is done over all 5 bits. 

e = The result, which is called the syndrome, is just 1 bit. The syndrome is 0 when the number of 


1s in the received codeword is even; otherwise, it is 1. 





So=b3+by+b,+botqo (modulo — 2) 





e The syndrome is passed to the decision logic analyzer. 

e If the syndrome is 0, there is no error in the received codeword; the data portion of the 
received codeword is accepted as the dataword. 

e Ifthe syndrome is 1, the data portion of the received codeword is discarded. The dataword is 


not created. 








Example: 

Assume the sender sends the dataword 1011. The codeword created from this dataword is 10111, 
which is sent to the receiver. 

1. No error occurs; the received codeword is 10111. The syndrome is O. The dataword 1011 is 


created. 


Nv 


One single-bit error changes a; The received codeword is 10011. The syndrome is 1. No 

dataword is created. 

3. One single-bit error changes rp The received codeword is 10110. The syndrome is 1. No 
dataword is created. Note that although none of the dataword bits are corrupted, no dataword 
is created because the code is not sophisticated enough to show the position of the corrupted 
bit. 

4. An error changes ro and a second error changes a; The received codeword is 00110. The 

syndrome is 0. The dataword 0011 is created at the receiver. Note that here the dataword is 

wrongly created due to the syndrome value. The simple parity-check decoder cannot detect 
an even number of errors. The errors cancel each other out and give the syndrome a value of 


0. 
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5. Three bits-a3, a2, and a; are changed by errors. The received codeword is 01011. The 
syndrome is 1. The dataword is not created. This shows that the simple parity check, 


guaranteed to detect one single error, can also find any odd number of errors. 





Limitation: A simple parity-check code can detect an odd number of errors. 


e A better approach is the two-dimensional parity check. In this method, the dataword is 
organized in a table. 

e The data to be sent, five 7-bit bytes, are put in separate rows. 

e For each row and each column, | parity-check bit is calculated. 

e The whole table is then sent to the receiver, which finds the syndrome for each row and each 
column. 

e The two-dimensional parity check can detect up to three errors that occur anywhere in the 


table. However, errors affecting 4 bits may not be detected. 
































z 
Column parities 
a. Design of row and column parities 
~< 
'b. One error affects two parities € Two errors affect two parities 























d. Three errors affect four parities e. Four errors cannot be detected 
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Hamming Codes 


These codes were originally designed with dmin = 3, which means that they can detect up to two 
errors or correct one single error. 

In hamming code we need to choose an integer m, say m >= 3. The values of n and k are then 
calculated from m as n = 2m — 1 and k =n - m. The number of check bits r =m. 

Eg: ifm = 3, n=7,k=4 

Hamming code C(7, 4) - n=7, k = 4: 


























Datawords Codewords Datawords Codewords 
0000 0000000, 000 1000110 
0001 OOOLIOL 001 1001011 
0010 0010111 010 1010001 
OOLI 0011010 O11 1011100 
0100 0100011 100 1100101 
O101 OLOLLLO 101 1101000 
0110 0110100 110 1110010 
OLLI 0111001 1 Haiti 


























Below figure shows the structure of the encoder and decoder: 
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A copy of a 4-bit dataword is fed into the generator that creates three parity checks. 
To=az+a;+ay modulo-2 
ry =a3+a,+a, modulo-2 


r,=a,+ag+az  modulo-2 


The checker in the decoder creates a 3-bit syndrome (s2s1s0) in which each bit is the parity 
check for 4 out of the 7 bits in the received codeword: 

So=bz+bi+bp modulo-2 

s1=b3+b2+b; —modulo-2 

s2=bit+bo+b3 modulo-2 


The 3-bit syndrome creates eight different bit patterns (000 to 111) that can represent eight 
different conditions. These conditions define a lack of error or an error in | of the 7 bits of the 


received codeword. 





Syndrome 000 001 010 oll 100 101 110 111 








Error None qo qi bp qe bo bs b 





























For example, if qo is in error, So is the only bit affected; the syndrome, therefore, is 001. If b2 is 
in error, So and S; are the bits affected; the syndrome therefore is 011. Similarly, if b; is in error, 


all 3 syndrome bits are affected and the syndrome is 111. 








Example: 

1. The dataword 0100 becomes the codeword 0100011. The codeword 0100011 is received. 
The syndrome is 000 (no error), the final dataword is 0100. 

The dataword 0111 becomes the codeword 0111001. The codeword 0011001 is received. 


Nv 


The syndrome is 011. Therefore b2 is in error. After flipping 52 (changing the 1 to 0), the 
final dataword is 0111. 

3. The dataword 1101 becomes the codeword 1101000. The codeword 0001000 is received 
(two errors). The syndrome is 101, which means that bp is in error. After flipping bp, we get 


0000, the wrong dataword. This shows that our code cannot correct two errors. 
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Performance 
A Hamming code can only correct a single error or detect a double error. However, there is a 


way to make it detect a burst error. 


Sender Receiver 












Codeword 4 









Codeword 3 






codeword 2 





Codeword 1 [i 





Burst error 






































Corrupted bits 


The key is to split a burst error between several codewords, one error for each codeword. 

To make the Hamming code respond to a burst error of size N, we need to make N codewords 
out of our frame. Then, instead of sending one codeword at a time, we arrange the codewords in 
a table and send the bits in the table a column at a time. 

In the above Figure, the bits are sent column by column (from the left). In each column, the bits 
are sent from the bottom to the top. In this way, a frame is made out of the four codewords and 
sent to the receiver. It is shown in the figure that when a burst error of size 4 corrupts the frame, 
only 1 bit from each codeword is corrupted. The corrupted bit in each codeword can then easily 


be corrected at the receiver. 


3.4 Cyclic Codes 


Cyclic codes are special linear block codes in which, if a codeword is cyclically shifted (rotated), 
the result is another codeword. 

For example, if 1011000 is a codeword and we cyclically left-shift, then 0110001 is also a 
codeword. 

In this case, if we call the bits in the first word ap to as and the bits in the second word hy to bg, 
we can shift the bits by using the following: 


bi=ay by=a; b3=ay by=az bs=ay be=as bo =ae 
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Cyclic Redundancy Check 
Sender Receiver 
Encoder Decoder 








Dataword 
a3]42]@1]0] 


Dataword 























Accept 


Divisor 











Remainder 


Unreliable 
transmission 



































Codeword Codeword 


Below Table shows an example of a CRC code. 





























Dataword Codeword Dataword Codeword 
0000 QO000000, 1000 1000101 
0001 OOOLO11 1001 1001110 
0010 0010110 1010 1010011 
0011 0011101 1011 1011000 
0100 0100111 1100 1100010 
0101 0101100 1101 1101001 
O110 0110001 1110 1110100 
O11 OL11010 111 1111111 




















e In the encoder, the dataword has k bits (4 here); the codeword has n bits (7 here). The size of 
the dataword is augmented by adding n - k (3 here) Os to the right-hand side of the word. The 


n-bit result is fed into the generator. 
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e The generator uses a divisor of size n - k + 1 (4 here), predefined and agreed upon. The 
generator divides the augmented dataword by the divisor (modulo-2 division). 

e The quotient of the division is discarded; the remainder is appended to the dataword to create 
the codeword. 

e The decoder receives the possibly corrupted codeword. A copy of all n bits is fed to the 
checker which is a replica of the generator. 

© The remainder produced by the checker is a syndrome of n - k (3 here) bits, which is fed to 
the decision logic analyzer. 

e The analyzer has a simple function. If the syndrome bits are all as, the 4 leftmost bits of the 
codeword are accepted as the dataword (interpreted as no error); otherwise, the 4 bits are 


discarded (error). 


Encoder 
The encoder takes the dataword and augments it with n - k number of Os. It then divides the 


augmented dataword by the divisor. 





Dataword|1_0 O 1 











Division { 





Quotient 


1010 
Divisor 101 1 1001{0 ~— [rere 
oS. 











Codeword 
Dataword Remainder 
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Decoder 

The codeword can change during transmission. The decoder does the same division process as 
the encoder. The remainder of the division is the syndrome. If the syndrome is all 0s, there is no 
error; the dataword is separated from the received codeword and accepted. Otherwise, everything 
is discarded. 

The left hand figure shows the value of syndrome when no error has occurred; the syndrome is 
000. The right-hand part of the figure shows the case in which there is one single error. The 


syndrome is not all Os 


codeword [TOOT TO] codeword [OOO TO] 
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Hardware Implementation 

One of the advantages of a cyclic code is that the encoder and decoder can easily and cheaply be 
implemented in hardware by using a handful of electronic devices. Also, a hardware 
implementation increases the rate of check bit and syndrome bit calculation. 

Divisor: 

1. The divisor is repeatedly XORed with part of the dividend. 

2. The divisor has n - k + 1 bits which either are predefined or are all Os. In other words, the 


bits do not change from one dataword to another. In previous example, the divisor bits were 
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either 1011 or 0000. The choice was based on the leftmost bit of the part of the augmented 
data bits that are active in the XOR operation. 

3. A close look shows that only n - k bits of the divisor is needed in the XOR operation. The 
leftmost bit is not needed because the result of the operation is always 0, no matter what the 
value of this bit. The reason is that the inputs to this XOR operation are either both Os or both 
Is. 





a aa! dq, do 


1. Assume that the remainder is originally all Os (000 in our example). 


XOR XOR 


Steps: 


2. At each time click (arrival of 1 bit from an augmented dataword), repeat the following two 
actions: 
a. Use the leftmost bit to make a decision about the divisor (011 or 000). 
b. The other 2 bits of the remainder and the next bit from the augmented dataword (total of 


3 bits) are XORed with the 3-bit divisor to create the next remainder. 


Below Figure shows this simulator, but note that this is not the final design; there will be more 


improvements. 
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At each clock tick, shown as different times, one of the bits from the augmented dataword is 
used in the XOR process. 

The above design is for demonstration purposes only. It needs simplification to be practical. 
First, we do not need to keep the intermediate values of the remainder bits; we need only the 
final bits. We therefore need only 3 registers instead of 24. After the XOR operations, we do not 
need the bit values of the previous remainder. Also, we do not need 21 XOR devices; two are 
enough because the output of an XOR operation in which one of the bits is 0 is simply the value 
of the other bit. This other bit can be used as the output. With these two modifications, the design 


becomes tremendously simpler and less expensive, as shown below 


Augmented dataword 
10 0 1 0 0 0 
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General Design 
Note: 








ditt qy do 
eee Dataword 
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Trek-1 
a. Encoder 
=. a ee: 
In-kt a 2 Received 
an codeword 
Sr-k-1 Sy So 
b. Decoder 


Polynomials 


A pattern of Os and 1s can be represented as a polynomial with coefficients of 0 and 1. The 
power of each term shows the position of the bit; the coefficient shows the value of the bit. 


Figure shows a binary pattern and its polynomial representation. 




















a as ay a; ay a ay 
a. Binary pattern and polynomial b. Short form 
Degree of a Polynomial 


The degree of a polynomial is the highest power in the polynomial. 
For example, the degree of the polynomial x6 + x + 1 is 6. Note that the degree of a polynomial 


is 1 less that the number of bits in the pattern. The bit pattern in this case has 7 bits. 
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Adding and Subtracting Polynomials 

Adding and subtracting polynomials in mathematics are done by adding or subtracting the 
coefficients of terms with the same power. In our case, the coefficients are only 0 and 1, and 
adding is in modulo-2. This has two consequences. First, addition and subtraction are the same. 
Second, adding or subtracting is done by combining terms and deleting pairs of identical terms. 
For example, adding x5 + x4 + x2 and x6 + x4 + x2 gives just x6 + x5. The terms x4 and x2 are 
deleted. However, note that if we add, for example, three polynomials and we get x2 three times, 


we delete a pair of them and keep the third. 


Multiplying or Dividing Terms 
In this arithmetic, multiplying a term by another term is very simple; we just add the powers. 
For example, x3 x x4 is x7 , For dividing, we just subtract the power of the second term from the 


power of the first. 


Multiplying Two Polynomials 

Multiplying a polynomial by another is done term by term. Each term of the first polynomial 
must be multiplied by all terms of the second. The result, of course, is then simplified, and pairs 
of equal terms are deleted. The following is an example: 


(+x? + x +x4 1) 


ax'terPecdeate rd 


7 5 3 
=x! +? +x7 +x 


txtere rer eren 


Dividing One Polynomial by Another 
We divide the first term of the dividend by the first term of the divisor to get the first term of the 
quotient. We multiply the term in the quotient by the divisor and subtract the result from the 


dividend. We repeat the process until the dividend degree is less than the divisor degree. 


Shifting 
A binary pattern is often shifted a number of bits to the right or left. Shifting to the left means 
adding extra Os as rightmost bits; shifting to the right means deleting some rightmost bits. 


Shifting to the left is accomplished by multiplying each term of the polynomial by xn, where m 
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is the number of shifted bits; shifting to the right is accomplished by dividing each term of the 
polynomial by x". The following shows shifting to the left and to the right. Note that we do not 
have negative powers in the polynomial representation. 

4 


Shifting left 3 bits: 10011 becomes 10011000 x* +x +1 becomes x” +x4 +27 


Shifting right 3 bits: 10011 becomes 10 x4 +x +1 becomes x 


bataword| 5327 | 


Divisor x3 + x 
~~ +x +1 xe + x 
xo + x4 + x3 


x4 





xt + x2 + x 





Dataword Remainder 
Cyclic Code Analysis 


Following notations can be used in the cyclic codes: 


Dataword: d(x) Syndrome: s(x) Codeword: c(x) 


Error: e(x) Generator: g(x) 


Ina cyclic code, 
1. If s(x) !=0, one or more bits is corrupted. 
2. Ifs(x) =0, either 

a. No bit is corrupted. or 


b. Some bits are corrupted, but the decoder failed to detect them. 


The received codeword is the sum of the sent codeword and the error. 
Received codeword =c(x) + e(x) 


The receiver divides the received codeword by g(x) to get the syndrome. 
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Received codeword = ¢(%) + e(x) 
g(x) 2(x) = g(x) 
The Right hand side of above equation is called as syndrome. 
If Syndrome does not have a remainder (syndrome =0), either e(x) is 0 or e(x) is divisible by 
g(x). 


Ina cyclic code, those e(x) errors that are divisible by g(x) are not caught. 


Single-Bit Error 

A single-bit error is e(x) =x’, where i is the position of the bit. If a single-bit error is caught, then 
x is not divisible by g(x). 

If the generator has more than one term and the coefficient of x° is 1, all single errors can be 


caught. 


Example: 
Which of the following g(x) values guarantees that a single-bit error is caught? For each case, 


what is the error that cannot be caught? 


a x1 
b x3 

c 1 
Solution: 


a. No x’ can be divisible by x + 1. In other words, x'/x + 1) always has a remainder. So the 
syndrome is nonzero. Any single-bit error can be caught. 

b. If i is equal to or greater than 3, x’ is divisible by g(x). The remainder of x'/x" is zero, and 
the receiver is fooled into believing that there is no error, although there might be one. All 
single-bit errors in positions | to 3 are caught. 


c. All values of i make j divisible by g(x). No single-bit error can be caught 


Two Isolated Single-Bit Errors 
Two isolated single bit errors can be represented as e(x) = x’ +x’. The values of i and j define the 


positions of the errors, and the difference j - i defines the distance between the two errors. 
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Difference: 


[ols fofafiftfofi 
xi 


AT 


























1Tofofofo][ifa 
xi 


xe 
We can write e(x) =x' (/-'+ 1)=x' @'+ 1). 
If a generator cannot divide x‘ + 1 (¢ between 0 and n - 1), then all isolated double errors can be 


detected. 


Example: 

Find the status of the following generators related to two isolated, single-bit errors. 

a xt] 

b x41 

ce x tx°+1 

dx tx 47 

Solution: 

a. This is a very poor choice for a generator. Any two errors next to each other cannot be 
detected. 

b. This generator cannot detect two errors that are four positions apart. The two errors can be 
anywhere, but if their distance is 4, they remain undetected. 

c. This is a good choice for this purpose. 

d. This polynomial cannot divide any error of type x’ + 1 if ¢ is less than 32,768. This means 
that a codeword with two isolated errors that are next to each other or up to 32,768 bits 


apart can be detected by this generator. 


Odd Numbers of Errors 


A generator that contains a factor of x + 1 can detect all odd-numbered errors. 


Burst Errors 
If L is the burst size and r is the degree of generator polynomial. 
e All burst errors with L <= r will be detected. 


All burst errors with L=r+ 1 will be detected with probability 1 - (1/2)"" 


¢ All burst errors with L > r+ I will be detected with probability 1- (1/2)' 
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Example: 

Find the suitability of the following generators in relation to burst errors of different lengths. 

a x°+1 

b. x'8tx74+x41 

ec. xx x74] 

Solution: 

a. This generator can detect all burst errors with a length less than or equal to 6 bits; 3 out of 
100 burst errors with length 7 will slip by; 16 out of 1000 burst errors of length 8 or more 
will slip by. 

b. This generator can detect all burst errors with a length less than or equal to 18 bits; 8 out of 
1 million burst errors with length 19 will slip by; 4 out of I million burst errors of length 20 
or more will slip by. 

c. This generator can detect all burst errors with a length less than or equal to 32 bits; 5 out of 
10 billion burst errors with length 33 will slip by; 3 out of 10 billion burst errors of length 
34 or more will slip by. 


A good polynomial generator needs to have the following characteristics: 
1. It should have at least two terms. 
2. The coefficient of the term x° should be 1. 
3. It should not divide x‘+ 1, for ¢ between 2 and n - 1. 
4. 


It should have the factor x + 1. 


Standard Polynomials 

















Name Polynomial Application 

CRC-8 xs ext 4 xt ATM header 

CRC-10 194 9 4 Pt xbox? $1 ATM AAL 

CRC-16 x4 IP 4 41 HDLC 

CRC-32 | x2 x76 4 PS 4 22 4 IO 4, IZ ell 4g IO LANs 
Bex7T+ rc taxteartxt 

















Advantages of Cyclic Codes 
e Cyclic codes have a very good performance in detecting single-bit errors, double errors, an 


odd number of errors, and burst errors. 





Abhijith H V, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 26 


Data Communication Module 3: Data Link Layer 





© They can easily be implemented in hardware and software. 


e They are especially fast when implemented in hardware. 


3.5 Checksum 


The checksum is used in the Internet by several protocols. The checksum is based on the concept 
of redundancy. 

Below Figure shows the process at the sender and at the receiver. The sender initializes the 
checksum to 0 and adds all data items and the checksum (the checksum is considered as one data 
item and is shown in color). The result is 36. However, 36 cannot be expressed in 4 bits. The 
extra two bits are wrapped and added with the sum to create the wrapped sum value 6. In the 
figure, we have shown the details in binary. The sum is then complemented, resulting in the 
checksum value 9 (15 - 6 = 9). The sender now sends six data items to the receiver including the 
checksum 9. The receiver follows the same procedure as the sender. It adds all data items 
(including the checksum); the result is 45. The sum is wrapped and becomes 15. The wrapped 
sum is complemented and becomes 0. Since the value of the checksum is 0, this means that the 
data is not corrupted. The receiver drops the checksum and keeps the other data items. If the 
checksum is not zero, the entire packet is dropped. 


Sender site Receiver site 


Sum —> 36 Sum —> 45, 


Wrapped sum —> 6 Packet Wrapped sum ——> 15 
Checksum —> 9 Checksum —> 0 





Details of wrapping Details of wrapping 
and complementing and complementing 


Internet Checksum 


Traditionally, the Internet has been using a 16-bit checksum. 
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Sender site: 


1. The message is divided into 16-bit words. 

2. The value of the checksum word is set to 0. 

3. All words including the checksum are added using one's complement addition. 
4, The sum is complemented and becomes the checksum. 

5. The checksum is sent with the data. 

Receiver site: 


1. The message (including checksum) is divided into 16-bit words. 

2. All words are added using one's complement addition. 

3. The sum is complemented and becomes the new checksum. 

4. If the value of checksum is 0, the message is accepted; otherwise, it is rejected. 























Example: 
1 O! Te? Carries 109 2 Carries 
F (Fo) 466 F (Fo) 
7 (ro) 7267 (ro) 
A (uz) v Ae a Jame a (uz) 
E (an) 616€E (an) 
0 Checksum (initial) 7 0 4 0 — Checksum (received) 
E Sum (partial) FF F E Sum (partial) 
——[+ Tt. st. 
8 F BF Sum FOF FF Sum 
© 4 0 Checksum (to send) 0 0 0 0 _— Checksum (new) 
a. Checksum at the sender site a. Checksum at the receiver site 
Performance 


The traditional checksum uses a small number of bits (16) to detect errors in a message of any 
size (sometimes thousands of bits). However, it is not as strong as the CRC in error-checking 
capability. For example, if the value of one word is incremented and the value of another word is 
decremented by the same amount, the two errors cannot be detected because the sum and 
checksum remain the same. Also if the values of several words are incremented but the total 
change is a multiple of 65535, the sum and the checksum does not change, which means the 


errors are not detected. 
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3.6 Framing 

e Data transmission in the physical layer means moving bits in the form of a signal from the 
source to the destination. 

e The physical layer provides bit synchronization to ensure that the sender and receiver use the 
same bit durations and timing. 

e The data link layer, on the other hand, needs to pack bits into frames, so that each frame is 
distinguishable from another. 

e Framing in the data link layer separates a message from one source to a destination, or from 
other messages to other destinations, by adding a sender address and a destination address. 

e The destination address defines where the packet is to go; the sender address helps the 
recipient acknowledge the receipt. 


e Frames can be of fixed or variable size. 


1) Fixed-Size Framing 

In fixed-size framing, there is no need for defining the boundaries of the frames; the size itself 

can be used as a delimiter. 

2) Variable-Size Framing 

e In variable-size framing, we need a way to define the end of the frame and the beginning of 
the next. 

e Historically, two approaches were used for this purpose: a character-oriented approach and a 


bit-oriented approach. 


Character-Oriented Protocols 
e Ina character-oriented protocol, data to be carried are 8-bit characters from a coding system 
such as ASCII. 


e The header, which normally carries the source and destination addresses and other control 
information, and the trailer, which carries error detection or error correction redundant bits, 


are also multiples of 8 bits. 
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e To separate one frame from the next, an 8-bit (1-byte) flag is added at the beginning and the 
end of a frame. The flag, composed of protocol-dependent special characters, signals the start 


or end of a frame. 


Data from upper layer 





Variable number of characters | 





Header eee Trailer 


e Flag should be different from the text pattern used in the data section. Otherwise, when the 
receiver encounters this pattern in the middle of the data, it thinks that it has reached the end 
of the frame. To fix this problem, a byte-stuffing strategy was added to character-oriented 
framing. 

e Byte stuffing is the process of adding 1 extra byte whenever there is a flag or escape 
character in the text. 

e The data section is stuffed with an extra byte. This byte is usually called the escape character 
(ESC), which has a predefined bit pattern. Whenever the receiver encounters the ESC 
character, it removes it from the data section and treats the next character as data, not a 
delimiting flag. 

e If even the escape character is also part of the text, an extra escape character is added to show 


that the second one is part of the text. 


Data from upper layer 
Fea [ | ESC 








Frame sent Stuffed 


Yy 
































Flag Trailer 
Extra 2 
bytes 
Frame received 
Header Esc | Flag esc | ESC Trailer 
Unstuffed | 
[ Flag [ [esc 





Data to upper layer 
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e  Character-oriented protocols present another problem in data communications. The universal 
coding systems in use today, such as Unicode, have 16-bit and 32-bit characters that conflict 


with 8-bit characters. 


Bit-Oriented Protocols: 
e Ina bit-oriented protocol, the data section of a frame is a sequence of bits to be interpreted by 
the upper layer as text, graphic, audio, video, and so on. 
e In addition to headers a special 8-bit pattern flag 01111110 is used as the delimiter to define 
the beginning and the end of the frame. 
Data from upper layer 
ble number of bits 








Flag Flag 


e If the flag pattern appears in the data, bit stuffing is used to differentiate the flag from 
information. 

e Bit stuffing is the process of adding one extra 0 whenever five consecutive 1s follow a 0 in 
the data, so that the receiver does not mistake the pattern 0111110 for a flag. 


e This extra stuffed bit is eventually removed from the data by the receiver. 


Data from upper 











Frame sent Stutted 











Frame received 





Unstuffed 


3.7 Flow and Error Control 
Flow Control 
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e Flow control coordinates the amount of data that can be sent before receiving an 
acknowledgment. 

e Flow control refers to a set of procedures used to restrict the amount of data that the sender 
can send before waiting for acknowledgment. 

© The flow of data must not be allowed to overwhelm the receiver. 

e Any receiving device has a limited speed at which it can process incoming data and a limited 
amount of memory in which to store incoming data. 

© The receiving device must be able to inform the sending device before those limits are 
reached and to request that the transmitting device send fewer frames or stop temporarily. 

e Incoming data must be checked and processed before they can be used. The rate of such 
processing is often slower than the rate of transmission. For this reason, each receiving 
device has a block of memory, called a buffer, reserved for storing incoming data until they 
are processed. If the buffer begins to fill up, the receiver must be able to tell the sender to halt 


transmission until it is once again able to receive. 


Error Control 

¢ Error control is both error detection and error correction. 

e It allows the receiver to inform the sender of any frames lost or damaged in transmission and 
coordinates the retransmission of those frames by the sender. 


e In the data link layer, the term error control refers primarily to methods of error detection 


and retransmission. 
e Error control in the data link layer is based on automatic repeat request, which is the 


retransmission of data. 


3.8 Protocols 


The protocols are normally implemented in software by using one of the common programming 


languages. 
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Protocols | 


Simplest Stop-and-Wait ARQ 
Go-Back-N ARQ 












Stop-and-Wait 
Selective Repeat ARQ 

In a real-life network, the data link protocols are implemented as bidirectional; data flow in both 
directions. In these protocols the flow and error control information such as ACKs and NAKs is 


included in the data frames in a technique called piggybacking. 


3.9 Noiseless Channels 


Simplest Protocol 

Simplest protocol is one that has no flow or error control. 

It is a unidirectional protocol in which data frames are traveling in only one direction-from the 

sender to receiver. 

Design 

e The data link layer at the sender site gets data from its network layer, makes a frame out of 
the data, and sends it. 

e The data link layer at the receiver site receives a frame from its physical layer, extracts data 
from the frame, and delivers the data to its network layer. 

e The data link layers of the sender and receiver provide transmission services for their 
network layers. 

e The data link layers use the services provided by their physical layers (such as signaling, 
multiplexing, and so on) for the physical transmission of bits. 

e The sender site cannot send a frame until its network layer has a data packet to send. The 


receiver site cannot deliver a data packet to its network layer until a frame arrives. 
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Receiver 





Network] Get data Deliver data | Network 


Data link Data link 


Pree 
Data frames —> 
ee SD 







Physical 





Request from 


Event:| network layer 











Repeat forever Repeat forever 








Notification from 


Event: 






































physical layer 
Sender Side Algorithm 
1 While(true) // Repeat forever 
2 i 
3 WaitForEvent (); // Sleep until an event occurs 
4 if (Event (RequestToSend) ) //There is a packet to send 
5 ff 
6 GetData(); 
7 MakeFrame (); 
8 SendFrame (); //Send the frame 
9 ii 
10 
Receiver Side Algorithm 
1 While (true) // Repeat forever 
2 if 
3 WaitForEvent (); // Sleep until an event occurs 
4 | if(Event (ArrivalNotification)) //Data frame arrived 
5 i 
6 ReceiveFrame() ; 
7 ExtractData(); 
8 DeliverData(); //Deliver data to network layer 
a: 
10 |} 
Example: 
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Figure shows an example of communication using this protocol. It is very simple. The sender 
sends a sequence of frames without even thinking about the receiver. To send three frames, three 


events occur at the sender site and three events at the receiver site. 


Sender Receiver 





Request ——| Frame = = = 
a Arrival 


Request ——>| Frame 1 
E 


i —S + Arrival 
1 

















Request ——_>] Frame ' 
Arrival 
1 
Y Y 
Time Time 





Stop-and-Wait Protocol 

e If data frames arrive at the receiver site faster than they can be processed, the frames must be 
stored until their use. 

e Normally, the receiver does not have enough storage space, especially if it is receiving data 
from many sources. This may result in either the discarding of frames or denial of service. 

© To prevent the receiver from becoming overwhelmed with frames, we somehow need to tell 
the sender to slow down. There must be feedback from the receiver to the sender. 

e The protocol is called the Stop-and-Wait Protocol because the sender sends one frame, stops 


until it receives confirmation from the receiver, and then sends the next frame. 


Design 
At any time, there is either one data frame on the forward channel or one ACK frame on the 


reverse channel. We therefore need a half-duplex link. 
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Network 


Data link 


Physical| Receive Send Receive Send | physical 


Event: 


Repeat forever 


Event: 





Receiver 


Deliver 
data 










frame frame frame frame 


~<— BACK frame 








Request from 
network layer 











Repeat forever 








Notification from Event: | Notification from 
physical layer “| physical layer 




















Sender Side Algorithm 








hile (true) 7/Repeat forever 
jcanSend = true //Allow the first frame to go 
{ 
WaitForEvent (); // Sleep until an event occurg 
if (Event (RequestToSend) AND canSend) 
{ 
GetData(); 
MakeFrame(); 
SendFrame(); //Send the data frame 
canSend = false; //Cannot send until ACK arrives 
+ 
WaitForEvent (); // Sleep until an event occurs 
if (Event (ArrivalNotification) // An ACK has arrived 
{ 
ReceiveFrame(); //Receive the ACK frame 
canSend = true; 
i 
+ 











Receiver Side Algorithm 








‘h HV, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 36 


Data Communication Module 3: Data Link Layer 








hile (true) //Repeat forever 


WaitForEvent (); // Sleep until an event occurg 
if (Event (ArrivalNotification)) //Data frame arrives 


ReceiveFrame(); 
ExtractData(); 

















Deliver (data) ; //Deliver data to network layer 
SendFrame() ; //Send an ACK frame 
10 | } 
11 } 
Example: 


The sender sends one frame and waits for feedback from the receiver. When the ACK arrives, 


the sender sends the next frame. 


Sender Receiver 








Request 
' Arrival 
' 
Arrival 1 
1 
1 
Request 1 
' Arrival 
' 
Arrival : 
Y : Y 
Time Time 
3.10 HDLC 


High-level Data Link Control (HDLC) is a bit-oriented protocol for communication over point- 


to-point and multipoint links. It implements the ARQ mechanisms. 


Configurations and Transfer Modes 





HDLC provides two common transfer modes that can be used in different configurations: 
1. Normal response mode (NRM) 
2. Asynchronous balanced mode (ABM) 
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Normal Response Mode: 

e In normal response mode (NRM), the station configuration is unbalanced. We have one 
primary station and multiple secondary stations. 

¢ A primary station can send commands; a secondary station can only respond. 


e The NRM is used for both point-to-point and multiple-point links 


























Primary Secondary 
Secondary Secondary 


























b. Multipoint 


Asynchronous Balanced Mode 
In asynchronous balanced mode (ABM), the configuration is balanced. The link is point-to-point, 


and each station can function as a primary and a secondary. 


Combined Combined 











Frames 


e HDLC defines three types of frames: 
1. Information frames (I-frames) 
2. Supervisory frames (S-frames) 
3. Unnumbered frames (U-frames) 
e Each type of frame serves as an envelope for the transmission of a different type of message. 
e [-frames are used to transport user data and control information relating to user data 
(piggybacking). 


e S-frames are used only to transport control information. 
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© U-frames are reserved for system management. Information carried by U-frames is intended 


for managing the link itself. 


Frame Format 


I-frame 


U-frame 





Fields 

e Flag field: The flag field of an HDLC frame is an 8-bit sequence with the bit pattern 
01111110 that identifies both the beginning and the end of a frame and serves as a 
synchronization pattern for the receiver. 

e Address field: The second field of an HDLC frame contains the address of the secondary 
station. If a primary station created the frame, it contains a to address. If a secondary creates 
the frame, it contains a from address. An address field can be | byte or several bytes long, 
depending on the needs of the network. One byte can identify up to 128 stations (1 bit is used 
for another purpose). Larger networks require multiple-byte address fields. If the address 
field is only 1 byte, the last bit is always a 1. If the address is more than 1 byte, all bytes but 
the last one will end with 0; only the last will end with 1. Ending each intermediate byte with 
0 indicates to the receiver that there are more address bytes to come. 

© Control field: The control field is a 1- or 2-byte segment of the frame used for flow and 
error control. The interpretation of bits in this field depends on the frame type. 

e Information field: The information field contains the user's data from the network layer or 
management information. Its length can vary from one network to another. 

e FCS field: The frame check sequence (FCS) is the HDLC error detection field. It can contain 
either a 2- or 4-byte ITU-T CRC. 
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Control Field 


The control field determines the type of frame and defines its functionality. 














N(S) N(R) 
[2 IT I 2c 
Code N(R) 
(Te IE +21 
Code Code 


Control Field for I-Frames 

e  [-frames are designed to carry user data from the network layer. In addition, they can include 
flow and error control information (piggybacking). 

e Ifthe first bit of the control field is 0, this means the frame is an I-frame. 

e The next 3 bits, called N(S), define the sequence number of the frame. Note that with 3 bits, 
we can define a sequence number between 0 and 7; but in the extension format, in which the 
control field is 2 bytes, this field is larger. 

e The last 3 bits, called N(R), correspond to the acknowledgment number when piggybacking 
is used. 

© The single bit between N(S) and N(R) is called the P/F bit. The P/F field is a single bit with a 
dual purpose. It has meaning only when it is set (bit = 1) and can mean poll or final. It means 
poll when the frame is sent by a primary station to a secondary. It means final when the 
frame is sent by a secondary to a primary. 

Control Field for S-Frames 

¢ Supervisory frames are used for flow and error control whenever piggybacking is either 
impossible or inappropriate. 

¢ S-frames do not have information fields. 

e Ifthe first 2 bits of the control field is 10, this means the frame is an S-frame. 

e The last 3 bits, called N(R), corresponds to the acknowledgment number (ACK) or negative 
acknowledgment number (NAK) depending on the type of S-frame. 

e The 2 bits called code is used to define the type of S-frame itself. With 2 bits, we can have 


four types of S-frames, as described below: 
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= Receive ready (RR): If the value of the code subfield is 00, it is an RR S-frame. This 
kind of frame acknowledges the receipt of a safe and sound frame or group of frames. In 
this case, the value N(R) field defines the acknowledgment number. 

= Receive not ready (RNR): If the value of the code subfield is 10, it is an RNR S-frame. 
This kind of frame is an RR frame with additional functions. It acknowledges the receipt 
of a frame or group of frames, and it announces that the receiver is busy and cannot 
receive more frames. It acts as a kind of congestion control mechanism by asking the 
sender to slow down. The value of NCR) is the acknowledgment number. 

= Reject (REJ): If the value of the code subfield is 01, it is a REJ S-frame. This is a NAK 
frame, but not like the one used for Selective Repeat ARQ. It is a NAK that can be used 
in Go-Back-N ARQ to improve the efficiency of the process by informing the sender, 
before the sender time expires, that the last frame is lost or damaged. The value of N(R) 
is the negative acknowledgment number. 

= Selective reject (SREJ): If the value of the code subfield is 11, it is an SREJ S-frame. 
This is a NAK frame used in Selective Repeat ARQ. Note that the HDLC Protocol uses 
the term selective reject instead of selective repeat. The value of N(R) is the negative 


acknowledgment number. 
Control Field for V-Frames 


e Unnumbered frames are used to exchange session management and control information 
between connected devices. Unlike S-frames, U-frames contain an information field, but one 
used for system management information, not user data. 

e As with S-frames, however, much of the information carried by U-frames is contained in 
codes included in the control field. 

e U-frame codes are divided into two sections: a 2-bit prefix before the P/F bit and a 3-bit 
suffix after the P/F bit. Together, these two segments (5 bits) can be used to create up to 32 


different types of U-frames. Some of the more common types are shown 
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Code | Command | Response Meaning 
00 001 SNRM Set normal response mode 
11 011 SNRME Set normal response mode, extended 
11 100 | SABM DM Set asynchronous balanced mode or disconnect mode 
11 110 | SABME Set asynchronous balanced mode, extended 
00 000 | UI UL Unnumbered information 
00 110 UA Unnumbered acknowledgment 
00 010 | DISC RD Disconnect or request disconnect 
10 000 | SIM RIM Set initialization mode or request information mode 
00 100 UP Unnumbered poll 
11 001 RSET Reset 
11 101 | XID XID Exchange ID 
10 001 FRMR FRMR Frame reject 




















Example: Connection/Disconnection 
Node A 





U-frame (SABM) 


U-frame (UA) 
Control 


Connection 
establishment 





11100 fH 110 





Data transfer 








U-frame (DISC) 


Connection 
release 











Example: Piggybacking without Error 
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Example: Piggybacking with error 
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3.11 Point-To-Point Protocol 

Today, millions of Internet users who need to connect their home computers to the server of an 
Internet service provider use PPP. The majority of these users have a traditional modem; they are 
connected to the Internet through a telephone line, which provides the services of the physical 
layer. But to control and manage the transfer of data, there is a need for a point-to-point protocol 


at the data link layer. 


PPP provides several services: 

1. PPP defines the format of the frame to be exchanged between devices. 

2. PPP defines how two devices can negotiate the establishment of link and the exchange of data. 
3. PPP defines how network layer data are encapsulated in the data link frame. 

4. PPP defines how two devices can authenticate each other. 

5. PPP provides multiple network layer services supporting a variety of network layer protocols. 
6. PPP provides connections over multiple links. 

7. PPP provides network address configuration. This is particularly useful when a home user 


needs a temporary network address to connect to the Internet. 


Limitations of PPP: 

1. PPP does not provide flow control. 

2. PPP has a very simple mechanism for error control. A CRC field is used to detect errors. If the 
frame is corrupted, it is silently discarded; the upper-layer protocol needs to take care of the 
problem. Lack of error control and sequence numbering may cause a packet to be received out of 
order. 

3. PPP does not provide a sophisticated addressing mechanism to handle frames in a multipoint 


configuration. 


Framing 
Frame Format 


11111111 11000000 



















Protocol 





Payload 






1 byte 1 byte 1 byte 1or2 bytes Variable 2or4bytes 1 byte 
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© Flag: A PPP frame starts and ends with 1-byte flag with the bit pattern 01111110. 

e Address: The address field in this protocol is a constant value and set to 11111111 
(broadcast address). 

© Control: This field is set to the constant value 11000000 

e Protocol: The protocol field defines what is being carried in the data field: either user data or 
other information. 

e Payload field: This field carries either the user data or other information. The data field is a 
sequence of bytes with the default of a maximum of 1500 bytes; but this can be changed 
during negotiation. Because there is no field defining the size of the data field, padding is 
needed if the size is less than the maximum default value or the maximum negotiated value. 

e FCS: The frame check sequence (FCS) is simply a 2-byte or 4-byte standard CRC. 

PPP is a byte-oriented protocol using byte stuffing with the escape byte 01111101. 


Transition Phases 


Failed 
















Carrier 
detected 






Dead Establish 


Carrier Options agreed 
dropped by both sides 


Terminate 


Done 







Failed If authentication 


not needed 






Authentication 
successful 


e Dead: In the dead phase the link is not being used. There is no active carrier (at the physical 





Network layer 
configuration 





layer) and the line is quiet. 

e Establish: When one of the nodes starts the communication, the connection goes into this 
phase. In this phase, options are negotiated between the two parties. If the negotiation is 
successful, the system goes to the authentication phase (if authentication is required) or 
directly to the networking phase. 

e Authenticate: The authentication phase is optional; the two nodes may decide, during the 


establishment phase, not to skip this phase. However, if they decide to proceed with 
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authentication, they send several authentication packets. If the result is successful, the 
connection goes to the networking phase; otherwise, it goes to the termination phase. 

e Network: In the network phase, negotiation for the network layer protocols takes place. PPP 
specifies that two nodes establish a network layer agreement before data at the network layer 
can be exchanged. The reason is that PPP supports multiple protocols at the network layer. If 
a node is running multiple protocols simultaneously at the network layer, the receiving node 
needs to know which protocol will receive the data. 

e Open: In the open phase, data transfer takes place. When a connection reaches this phase, the 
exchange of data packets can be started. The connection remains in this phase until one of the 
endpoints wants to terminate the connection. 

e Terminate: In the termination phase the connection is terminated. Several packets are 


exchanged between the two ends for house cleaning and closing the link. 
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Module — 4 
MEDIA ACCESS CONTROL, ETHERNET, 
WIRELESS LAN 


Data Link Layer is divided into two sub layers: 


Data link layer 








Data link control 











e The upper sub layer is responsible for data link control, and the lower sub layer is responsible 





for resolving access to the shared media. 

e The upper sub layer that is responsible for flow and error control is called the logical link 
control (LLC) layer; 

e The lower sub layer that is mostly responsible for multiple access resolution is called the 
Media Access Control (MAC) layer. 

e When nodes or stations are connected and use a common link, called a multipoint or 
broadcast link, we need a multiple-access protocol to coordinate access to the link. 


e Many formal protocols have been devised to handle access to a shared link. 


protocols 











t— ALOHA Reservation 

H— csma Polling 

H— csma/cD Token passing ae 
L— csmas/ca 


4.1 Random Access 


e In random access or contention methods, no station is superior to another station and none is 
assigned the control over another. No station permits, or does not permit, another station to 


send. 
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e At each instance, a station that has data to send uses a procedure defined by the protocol to 
make a decision on whether or not to send. This decision depends on the state of the medium 
(idle or busy). 

e There is no scheduled time for a station to transmit. Transmission is random among the 
stations. That is why these methods are called random access. 

e Norules specify which station should send next. Stations compete with one another to access 


the medium. That is why these methods are also called contention methods. 


ALOHA 
ALOHA, the earliest random access method, was developed at the University of Hawaii in early 


1970. It was designed for a radio (wireless) LAN, but it can be used on any shared medium. 


Pure ALOHA 
e The original ALOHA protocol is called pure ALOHA. The idea is that each station sends a 


frame whenever it has a frame to send. However, since there is only one channel to share, 


there is the possibility of collision between frames from different stations. 


sion CA remem. 00D coo é 





























Time 
Station 2 sa | EES eNEEStS Frame21] oo Frame22] ss 
ES Time 


Station 3 | 


station 4 











Collision Collision 
duration duration 


Here there are four stations that contend with one another for access to the shared channel. The 
figure shows that each station sends two frames; there are a total of eight frames on the shared 


medium. Some of these frames collide because multiple frames are in contention for the shared 
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channel. Figure shows that only two frames survive: frame 1.1 from station | and frame 3.2 from 

station 3. 

e The pure ALOHA protocol relies on acknowledgments from the receiver. When a station 
sends a frame, it expects the receiver to send an acknowledgment. If the acknowledgment 
does not arrive after a time-out period, the station assumes that the frame (or the 
acknowledgment) has been destroyed and resends the frame. 

e If two or more stations try to resend their frames after the time-out, the frames will collide 
again. Pure ALOHA dictates that when the time-out period passes, each station waits a 
random amount of time before resending its frame. The randomness will help avoid more 
collisions. 

e After a maximum number of retransmission attempts Kmax' a station must give up and try 


later. 


Station has 
a frame to send 








K: Number of attempts 

Tp: Maximum propagation time 

T;,; Average transmission time for a frame 
Tp: Back-off time 





Send the frame 









Choose a random 
number R between 
Oand 2) 


Kmax is 
normally 15 





Vulnerable time: is the length of time in which there is a possibility of collision. 
Station A sends a frame at time 4. Now imagine station B has already sent a frame between ¢ - Tj, 
and ¢. This leads to a collision between the frames from station A and station B. The end of B's 


frame collides with the beginning of A's frame. On the other hand, suppose that station C sends a 
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frame between ¢ and t + 7;.. Here, there is a collision between frames from station A and station 


C. The beginning of C's frame collides with the end of A's frame. 


B's end A’s end 
collides with collides with 
A's beginning C's beginning 


| | 
ge = 2 


> 
t+ Tp Time 


Vulnerable time = 2 x Ty, | 











t 
' 
' 
t 
t 
t 

t 


t-Tr 





The vulnerable time, during which a collision may occur in pure ALOHA, is 2 times the frame 


transmission time. 








Pure ALOHA vulnerable time = 2 x Ty 





Throughput: The throughput for pure ALOHA is S = Ge*° 
Where G is the average number of frames generated by the system during one frame 
transmission time. 


The maximum throughput Smax = 0.184 when G = (1/2). 


Slotted ALOHA 


e Pure ALOHA has a vulnerable time of 2 x 7fr. This is so because there is no rule that defines 
when the station can send. A station may send soon after another station has started or soon 
before another station has finished. 

© Slotted ALOHA was invented to improve the efficiency of pure ALOHA. 

e In slotted ALOHA the time is divided into slots of Ts, s and force the station to send only at 


the beginning of the time slot. 
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Collision Collision 
duration — duration 
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Slot 1 Slot 2 Slot 3 Slot 4 Slot 5 Slot 6 

Because a station is allowed to send only at the beginning of the synchronized time slot, if a 
station misses this moment, it must wait until the beginning of the next time slot. This means that 
the station which started at the beginning of this slot has already finished sending its frame. 
There is still the possibility of collision if two stations try to send at the beginning of the same 
time slot. 


The vulnerable time for slotted ALOHA is one-half that of pure ALOHA. 








Slotted ALOHA vulnerable time = Tfr 
A collides with C 














‘ = 

= ce) 
: > A = 
a a 
i ei 
‘ € aed 
e a & 

> 
t-Tr t t+T pp Time 
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The throughput for slotted ALOHA is S = Ge“ 
The maximum throughput Smax = 0.368 when G=1. 
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Carrier Sense Multiple Access (CSMA) 


To minimize the chance of collision and, therefore, increase the performance, the CSMA method 
was developed. 

The chance of collision can be reduced if a station senses the medium before trying to use it. 
Carrier sense multiple access (CSMA) requires that each station first listen to the medium (or 
check the state of the medium) before sending. In other words, CSMA is based on the principle 


"sense before transmit" or "listen before talk.” 


B starts C starts 
attime t, at time t, 


Area where 
B’s signal exists 






Area where | = 
both signals exist 


Area where 
Y C’s signal exists 
Time Time 








The possibility of collision still exists because of propagation delay; when a station sends a 
frame, it still takes time (although very short) for the first bit to reach every station and for every 
station to sense it. In other words, a station may sense the medium and find it idle, only because 
the first bit sent by another station has not yet been received. 

At time t] station B senses the medium and finds it idle, so it sends a frame. At time t2 (t2> tl) 
station C senses the medium and finds it idle because, at this time, the first bits from station B 
have not reached station C. Station C also sends a frame. The two signals collide and both frames 
are destroyed. 

Vulnerable Time 

The vulnerable time for CSMA is the propagation time Tp . This is the time needed for a signal 
to propagate from one end of the medium to the other. When a station sends a frame, and any 


other station tries to send a frame during this time, a collision will result. 
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propagation time 


Time Time 


Persistence Methods 

There are three persistence methods which decide the action that has to be performed by station 
when the channel is busy or idle. 

1) 1-Persistent: 

The 1-persistent method is simple and straightforward. In this method, after the station finds the 
line idle, it sends its frame immediately (with probability 1). 

This method has the highest chance of collision because two or more stations may find the line 


idle and send their frames immediately. 





Sense 
and transmit 


Continuously sense 





> Time 





Busy 











a. 1-persistent 





Busy 
Idle 


Station can transmit. 





a. 1-persistent 
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2) Nonpersistent: 

In the nonpersistent method, a station that has a frame to send senses the line. If the line is idle, it 
sends immediately. If the line is not idle, it waits a random amount of time and then senses the 
line again. 

The nonpersistent approach reduces the chance of collision because it is unlikely that two or 
more stations will wait the same amount of time and retry to send simultaneously. 

However, this method reduces the efficiency of the network because the medium remains idle 


when there may be stations with frames to send. 





Sense 
and transmit 








b. Nonpersistent 





Busy 
Idle 


Station can transmit. 











b. Nonpersistent 
3) p-Persistent: 
Thep-persistent method is used if the channel has time slots with a slot duration equal to or 
greater than the maximum propagation time. 
It reduces the chance of collision and improves efficiency. In this method, after the station finds 
the line idle it follows these steps: 
1. With probability p, the station sends its frame. 
2. With probability g = 1-p, the station waits for the beginning of the next time slot and checks 
the line again. 
a. If the line is idle, it goes to step 1. 


b. If the line is busy, it acts as though a collision has occurred and uses the backoff procedure. 
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Carrier Sense Multiple Access with Collision Detection (CSMA/CD) 


In Carrier sense multiple access with collision detection (CSMA/CD) a station monitors the 
medium after it sends a frame to see if the transmission was successful. If so, the station is 


finished. If there is a collision, the frame is sent again. 












q 


Transmission 2 Transmission 























time << t;Itime 
cH First bit of Cscol 
A's collision detection and 
detection Collision abortion 
and abortion occurs 
Time Time 








h HV, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 9 





Module 4: MAC, Ethernet, WLAN 





At time tl, station A has executed its persistence procedure and starts sending the bits of its 
frame. At time #2, station C has not yet sensed the first bit sent by A. Station C executes its 
persistence procedure and starts sending the bits in its frame, which propagate both to the left 
and to the right. The collision occurs sometime after time t2. Station C detects a collision at time 
t3 when it receives the first bit of A's frame. Station C immediately aborts transmission. Station 
A detects collision at time t4 when it receives the first bit of C's frame; it also immediately aborts 


transmission. 













Collision 
occurs 


Transmission} RMI > Transmission 
time t;Itime 


Adetects 
collision and 


aborts 
Cdetects 


collision 
Time and aborts Time 


Minimum Frame Size 

Before sending the last bit of the frame, the sending station must detect a collision, if any, abort 
the transmission. This is so because the station, once the entire frame is sent, does not keep a 
copy of the frame and does not monitor the line for collision detection. Therefore, the frame 
transmission time T;; must be at least two times the maximum propagation time Tp. 

If the two stations involved in a collision are the maximum distance apart, the signal from the 
first takes time Tp to reach the second, and the effect of the collision takes another time Tp to 


reach the first. So the requirement is that the first station must still be transmitting after 2Tp. 


Difference between ALOHA and CSMA/CD 

e The first difference is the addition of the persistence process. 

e The second difference is the frame transmission. In ALOHA, we first transmit the entire 
frame and then wait for an acknowledgment. In CSMA/CD, transmission and collision 
detection is a continuous process. 

e The third difference is the sending of a short jamming signal that enforces the collision in 


case other stations have not yet sensed the collision. 
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Ethernet, WLAN 





Station has Cart > 
K: Number of attempts a frame to send 


Tp: Maximum propagation time 


Tj,: Average transmission time fora frame 
Ty: Backoff time Lo] 
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Choose a random 
number R between 
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Apply one of the 
persistence methods 





(1-persistent, nonpersistent, 
or p-persistent) 




















Kinax is eo Senda Yes 
normally 15 











Energy Level 


The level of energy in a channel can have three values: zero, normal, and abnormal. At the zero 


level, the channel is idle. At the normal level, a station has successfully captured the channel and 


is sending its frame. At the abnormal level, there is a collision and the level of the energy is 


twice the normal level. A station that has a frame to send or is sending a frame needs to monitor 


the energy level to determine if the channel is idle, busy, or in collision mode. 


























A Energy 
Collision 
Frame transmission Frame transmission Time 
Idle 
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Throughput 

The maximum throughput occurs at a different value of G and is based on the persistence method 
and the value of p in the p-persistent approach. For 1-persistent method the maximum throughput 
is around 50 percent when G =1. For nonpersistent method, the maximum throughput can go up 


to 90 percent when G is between 3 and 8. 


Carrier Sense Multiple Access with Collision Avoidance(CSMA/CA) 


Collisions are avoided through the use of CSMA/CA's three strategies: the interframe space, the 


contention window, and acknowledgments. 


Size: 


rouse binary exponential 


idle 
Continuously sense 


| 






Busy Contention window send frame Time-out. Time 


Interframe Space (IFS) 

Collisions are avoided by deferring transmission even if the channel is found idle. When an idle 
channel is found, the station does not send immediately. It waits for a period of time called the 
interframe space or IFS. Even though the channel may appear idle when it is sensed, a distant 
station may have already started transmitting. The distant station's signal has not yet reached this 
station. The IFS time allows the front of the transmitted signal by the distant station to reach this 
station. If after the IFS time the channel is still idle, the station can send, but it still needs to wait 
a time equal to the contention time. The IFS variable can also be used to prioritize stations or 


frame types. For example, a station that is assigned a shorter IFS has a higher priority. 


Contention Window 

The contention window is an amount of time divided into slots. A station that is ready to send 
chooses a random number of slots as its wait time. The number of slots in the window changes 
according to the binary exponential back-off strategy. This means that it is set to one slot the first 


time and then doubles each time the station cannot detect an idle channel after the IFS time. The 
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station needs to sense the channel after each time slot. However, if the station finds the channel 
busy, it does not restart the process; it just stops the timer and restarts it when the channel is 


sensed as idle. This gives priority to the station with the longest waiting time. 


Acknowledgment 
With all these precautions, there still may be a collision resulting in destroyed data. In addition, 
the data may be corrupted during the transmission. The positive acknowledgment and the time- 


out timer can help guarantee that the receiver has received the frame. 
= 
No 
Yes 
No 
Yes 


Choose a random 
number R between 
Oand 2%-1 











Contention window 
size is2"-1. 





After each slot, if idle, 
continue; if busy, halt and 
continue when idle. 





Send fra 








it 
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4.2 Controlled Access 


In controlled access, the stations consult one another to find which station has the right to send. 


A station cannot send unless it has been authorized by other stations. 


Reservation 

e In the reservation method, a station needs to make a reservation before sending data. 

e Time is divided into intervals. In each interval, a reservation frame precedes the data frames 
sent in that interval. 

e If there are N stations in the system, there are exactly N reservation minislots in the 
reservation frame. Each minislot belongs to a station. When a station needs to send a data 
frame, it makes a reservation in its own minislot. The stations that have made reservations 


can send their data frames after the reservation frame. 








an 2345 12345 
ea Data Data 
fool O}0/0/0 ea 1 0/0)0/0 station 4 station 3 Bee 
Reservation 
frame 





e Here in the first interval, only stations 1, 3, and 4 have made reservations. In the second 


interval, only station 1 has made a reservation. 


Polling 

e Polling works with topologies in which one device is designated as a primary station and the 
other devices are secondary stations. 

e All data exchanges must be made through the primary device even when the ultimate 
destination is a secondary device. 

¢ The primary device controls the link; the secondary devices follow its instructions. It is up to 
the primary device to determine which device is allowed to use the channel at a given time. 


The primary device, therefore, is always the initiator of a session 
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e Ifthe primary wants to receive data, it asks the secondaries if they have anything to send; this 
is called poll function. 
e Ifthe primary wants to send data, it tells the secondary to get ready to receive; this is called 


select function. 


Select 

e The se/ect function is used whenever the primary device has something to send. 

e If it has something to send, the primary device sends it. 

e The primary must alert the secondary to the upcoming transmission and wait for an 
acknowledgment of the secondary's ready status. 

e Before sending data, the primary creates and transmits a select (SEL) frame, one field of 


which includes the address of the intended secondary. 


Poll 

© The poll function is used by the primary device to solicit transmissions from the secondary 
devices. 

e When the primary is ready to receive data, it must ask (poll) each device in turn if it has 
anything to send. 

e When the first secondary is approached, it responds either with a NAK frame if it has nothing 
to send or with data if it does. 

e If the response is negative (a NAK frame), then the primary polls the next secondary in the 
same manner until it finds one with data to send. When the response is positive (a data 
frame), the primary reads the frame and returns an acknowledgment (ACK frame), verifying 


its receipt. 
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Token Passing 

e In the token-passing method, the stations in a network are organized in a logical ring. 

¢ For each station, there is a predecessor and a successor. The predecessor is the station which 
is logically before the station in the ring; the successor is the station which is after the station 
in the ring. The current station is the one that is accessing the channel now. 

e The right to this access has been passed from the predecessor to the current station. The right 
will be passed to the successor when the current station has no more data to send. 

e In this method, a special packet called a token circulates through the ring. The possession of 
the token gives the station the right to access the channel and send its data. When a station 
has some data to send, it waits until it receives the token from its predecessor. It then holds 
the token and sends its data. When the station has no more data to send, it releases the token, 
passing it to the next logical station in the ring. 

e The station cannot send data until it receives the token again in the next round. 

¢ In this process, when a station receives the token and has no data to send, it just passes the 
data to the next station. 

e Token management is needed for this access method. 
= Stations must be limited in the time they can have possession of the token. 
= The token must be monitored to ensure it has not been lost or destroyed. 
= Another function of token management is to assign priorities to the stations and to the 

types of data being transmitted. And finally, token management is needed to make low- 
priority stations release the token to high priority stations. 

e Ina token-passing network, stations do not have to be physically connected in a ring; the ring 
can be a logical one. 

e In the physical ring topology, when a station sends the token to its successor, the token 
cannot be seen by other stations; the successor is the next one in line. This means that the 
token does not have to have the address of the next successor. The problem with this 
topology is that if one of the links-the medium between two adjacent stations fails, the whole 
system fails. 

e The dual ring topology uses a second (auxiliary) ring which operates in the reverse direction 


compared with the main ring. The second ring is for emergencies only. If one of the links in 
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the main ring fails, the system automatically combines the two rings to form a temporary 
ring. After the failed link is restored, the auxiliary ring becomes idle again. 

In the bus ring topology, also called a token bus, the stations are connected to a single cable 
called a bus. They, however, make a logical ring, because each station knows the address of 
its successor. When a station has finished sending its data, it releases the token and inserts the 
address of its successor in the token. Only the station with the address matching the 
destination address of the token gets the token to access the shared media. 

In a star ring topology, the physical topology is a star. There is a hub, however, that acts as 
the connector. The wiring inside the hub makes the ring; the stations are connected to this 
ring through the two wire connections. This topology makes the network less prone to failure 
because if a link goes down, it will be bypassed by the hub and the rest of the stations can 


operate. Also adding and removing stations from the ring is easier. 
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4.3 Channelization 


Channelization is a multiple-access method in which the available bandwidth of a link is shared 


in time, frequency, or through code, between different stations. 


Frequency-Division Multiple Access (FDMA) 





In frequency-division multiple access (FDMA), the available bandwidth is divided into 
frequency bands. 
Each station is allocated a band to send its data. In other words, each band is reserved for a 


specific station, and it belongs to the station all the time. 
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e Each station also uses a bandpass filter to confine the transmitter frequencies. 
e To prevent station interferences, the allocated bands are separated from one another by small 


guard bands. 


Data Data 












Common 
channel 








Silent Data 


Time-Division Multiple Access (TDMA) 





e In time-division multiple access (TDMA), the stations share the bandwidth of the channel in 
time. 

e Each station is allocated a time slot during which it can send data. Each station transmits its 
data in is assigned time slot. 

e The main problem with TDMA lies in achieving synchronization between the different 
stations. 

e Each station needs to know the beginning of its slot and the location of its slot. This may be 
difficult because of propagation delays introduced in the system if the stations are spread 
over a large area. 

e To compensate for the delays, we can insert guard times. Synchronization is normally 


accomplished by having some synchronization bits at the beginning of each slot. 
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Data Data 























Silent Data 


Cod ion Multiple Access (CDMA) 


CDMA differs from FDMA because only one channel occupies the entire bandwidth of the link. 


It differs from TDMA because all stations can send data simultaneously; there is no timesharing. 


Let us assume we have four stations 1, 2, 3, and 4 connected to the same channel. The data from 
station 1 are d/, from station 2 are d2, and so on. The code assigned to the first station is cl, to 
the second is c2, and so on. We assume that the assigned codes have two properties. 

1. If we multiply each code by another, we get 0. 

2. If we multiply each code by itself, we get 4 (the number of stations). 

Station | multiplies its data by its code to get dl.cl Station 2 multiplies its data by its code to get 
d2.c2 and so on. The data that go on the channel are the sum of all these terms. 

Any station that wants to receive data from one of the other three multiplies the data on the 
channel by the code of the sender. For example, suppose stations | and 2 are talking to each 
other. Station 2 wants to hear what station | is saying. It multiplies the data on the channel by cl 
the code of station 1. 

Because (cl . cl) is 4, but (c2 . cl), (c3 . cl), and (c4 . cl) are all Os, station 2 divides the result by 
4 to get the data from station 1. 

data =(d1.C1 + d2.C2 +d3.C3 + d4.c4) . Cl= 4x dl 
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a d2 





7 2 
diy d2-c2 








Common 
channel 





CDMA is based on coding theory. Each station is assigned a code, which is a sequence of 


numbers called chips. 


Gj a Gs. con 


They are called orthogonal sequences and have the following properties: 
1. Each sequence is made of N elements, where N is the number of stations. 
2. If we multiply a sequence by a number, every element in the sequence is multiplied by that 
element. This is called multiplication of a sequence by a scalar. For example, 
2. [4141-1 -1] = [42 42 -2-2] 

3. If we multiply two equal sequences, element by element, and add the results, we get N, where 
N is the number of elements in the each sequence. This is called the inner product of two equal 
sequences. For example, 

[HI 41-11]: [+1 41-1 -1]= 141414124 
4. If we multiply two different sequences, element by element, and add the results, we get 0. This 
is called inner product of two different sequences. For example, 

[4141 -1-1]* [+141 +141]=14+1-1-1=0 
5. Adding two sequences means adding the corresponding elements. The result is another 
sequence. For example, 


[41 41-1 1} [41 41 41 H1]=[42 4200) 
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Data Representation 
Data bit O———>- -1 | Data bit > +1 | Silence ———> 0 | 


Encoding and Decoding 


























Example: 
Bito Bito 
A 1 
q Gq 
G1 +1 41 417 Et sh 2) 
dy-cy da. cy 
y 13-0 ti+l 14+ y 
Common 
channel 
fo 0 Oo oO} G1 -1 -141 
G3 d3-cs dy Ca Gy <j 
EA 1 21 21) t eet Sie] 
0 +1 
Silent Bit1 


Imagine station 3, which we said is silent, is listening to station 2. Station 3 multiplies the total 
data on the channel by the code for station 2, which is [+1 -1 +1-1], to get 
[1-1-3 +1]- [41-1 41-1] =-4/4 =1 > bit 1 


Signal Level 


Bit O —>| 


[1-1-1 -1] 
a .. 


fj} 
Silent —>f3 f-—> 10 0 0 9 > 
(at 


Bit O —>| 





Time 


ae — 
Time 


Data on the channel 


Bit 1 —>] 





Time 
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Decoding of the composite signal for one in CDMA 


Data on the channel 
Time 


Station 2's code 
FAT =1 +1 -1] | T ~ 
Time 


Inner product result 








Time 





Y 


Summing the values 
Time 


-4 —> -4/4 —> -1 —> Bito 
Sequence Generation 
To generate chip sequences, Walsh table is used, which is a two-dimensional table with an equal 


number of rows and columns. 





Wn Wn 
W,= [+1] Won = 





a. Two basic rules 


i 


wv, - [2 aa 
41 -1 -1 +1 





Wi 

















b. Generation of W,, Wj,and W, 


In the Walsh table, each row is a sequence of chips. 


The number of sequences in a Walsh table needs to beN = 2m. 
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Ethernet 
IEEE Standards 


The relationship of the 802 Standard to the traditional OSI model is shown in the figure. The 
IEEE has subdivided the data link layer into two sublayers: logical link control (LLC) and media 
access control (MAC). IEEE has also created several physical layer standards for different LAN 
protocols. 


LLC: Logical link control 
MAC: Media access control 





Upper layers Upper layers 
































Data link layer 
Ethernet Token Ring Token Bus 
MAC MAC MAC ra 
~~ = Ethernet e 
Physical layer physical layers aes Ris) ee aa 
geveran, physical layer physical layer 
OSI or Internet model IEEE Standard 


Data Link Layer 
The data link layer in the IEEE standard is divided into two sublayers: LLC and MAC. 


Logical Link Control (LL© - flow control, error control, part of framing. Provides one single 
data link control for all IEEE LANs 
Framing LLC defines a protocol data unit (PDU) that is somewhat similar to that of HDLC. 

e The header contains a control field like the one in HDLC; this field is used for flow and 
error control. 

e The two other header fields define the upper-layer protocol at the source and destination 
that uses LLC. These fields are called the destination service access point (DSAP) and 
the source service access point (SSAP). 

e In other words, a frame defined in HDLC is divided into a PDU at the LLC sublayer and 
a frame at the MAC sublayer, as shown in figure. 
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FCS: Frame Check Sequence: Contains CRC-32 Error Detection sequence 

















DSAP: Destination service access point LLC PDU 
SSAP: Source service access point alee - 
SanSalcontral Upper-layer 
>) AIA data 
P| P 
Address | Control Dares yet : 
lata H ' 
HDLC frame MAC 
header MAC payload 
MAC frame 


Need for LLC The purpose of the LLC is to provide flow and error control for the upper-layer 
protocols that actually demand these services. 
For example, if a LAN or several LANs are used in an isolated system, LLC may be needed to 


provide flow and error control for the application layer protocols. 


Media Access Control (MAC) - Defines the specific access method for each type of LAN 





(Ethernet-CSMA/CD, Token Ring and Token Bus-Token Passing). Provides part of framing 


function. 


Physical Layer 
e The physical layer is dependent on the implementation and type of physical media used. 
e IEEE defines detailed specifications for each LAN implementation. 
e For example, although there is only one MAC sublayer for Standard Ethernet, there is a 


different physical layer specification for each Ethernet implementations. 


4.4 STANDARD ETHERNET 


The original Ethernet was created in 1976 at Xerox's Palo Alto Research Center (PARC). Since 
then, it has gone through four generations: Standard Ethernet (10 t Mbps), Fast Ethernet (100 
Mbps), Gigabit Ethernet (1 Gbps), and Ten-Gigabit Ethernet (10 Gbps), as shown in the figure: 
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Ethernet 
evolution 


[ [ [ 1 


Standard Fast Gigabit Ten-Gigabit 
Ethernet Ethernet Ethernet Ethernet 


10 Mbps 100 Mbps 1 Gbps 10 Gbps 








MAC Sublayer 


In Standard Ethernet, the MAC sublayer governs the operation of the access method. It also 


frames data received from the upper layer and passes them to the physical layer. 


Frame Format 
Ethernet does not provide any mechanism for acknowledging received frames, making it what is 
known as an unreliable medium. Acknowledgments must be implemented at the higher layers. 


The format of the MAC frame is shown in the figure. 


Preamble: 56 bits of alternating 1s and Os. 
SFD: Start frame delimiter, flag (10101011) 


rer | | Destination | Source | Length | 
Sddidss address | ortype Data and padding} CRC 
7bytes 1byte| 6 bytes 6bytes 2 bytes 4 bytes 


Physical layer 
header 





e Preamble. The first field of the 802.3 frame contains 7 bytes (56 bits) of alternating 0s and 
Is that alerts the receiving system to the coming frame and enables it to synchronize its input 
timing. 

e Start frame delimiter (SFD). The second field (1 byte: 10101011) signals the beginning of 
the frame. The SFD warns the station or stations that this is the last chance for 
synchronization. The last 2 bits is 11 and alerts the receiver that the next field is the 


destination address. 
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e Destination address (DA). The DA field is 6 bytes and contains the physical address of the 
destination station or stations to receive the packet. 

e Source address (SA). The SA field is also 6 bytes and contains the physical address of the 
sender of the packet. 

e Length or type. Define upper layer protocol, length: the number of bytes in data field Data. 
It is a minimum of 46 and a maximum of 1500 bytes. 


e CRC. The last field contains error detection information, in this case a CRC-32. 


Frame Length 


Ethernet has imposed restrictions on both the minimum and maximum lengths of a frame, as 


shown in figure. 


Minimum payload length: 46 bytes 
Maximum payload length: 1500 bytes 








Destination Source Length 7 
address address PDU Weieins eacleling re 
6 bytes 6 bytes 2 bytes 4 bytes 


Minimum frame length: 512 bits or 64 bytes 





Maximum frame length: 12,144 bits or 1518 bytes 


Addressing 
e Each station on an Ethernet network (such as a PC, workstation, or printer) has its own 
network interface card (NIC). 
e The NIC fits inside the station and provides the station with a 6-byte physical address. 
e As shown in the figure, the Ethernet address is 6 bytes (48 bits), normally written in 


hexadecimal notation, with a colon between the bytes. 


06:01 :02:01:2C:4B 
L J 


6 bytes = 12 hex digits = 48 bits 
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Unicast, Multicast, and Broadcast Addresses: 
e A source address is always a unicast address - the frame comes from only one station. 


e The destination address, however, can be unicast, multicast, or broadcast. 


The following figure shows how to distinguish a unicast address from a multicast address. 
If the least significant bit of the first byte in a destination address is 0, the address is unicast; 
otherwise, it is multicast. 
Unicast: 0; multicast: 1 
Byte 1 Byte 2 Byte 6 
e A unicast destination address defines only one recipient; the relationship between the sender 
and the receiver is one-to-one. 
e A multicast destination address defines a group of addresses; the relationship between the 
sender and the receivers is one4o-many. 
e The broadcast address is a special case of the multicast address; the recipients are all the 


stations on the LAN. A broadcast destination address is forty-eight Is. 


Access Method: CSMA/CD 
Standard Ethernet uses 1-persistent COMA/CD 
Slot Time In an Ethernet network, the round-trip time required for a frame to travel from one 
end of a maximum-length network to the other plus the time needed to send the jam sequence is 
called the slot time. 

Slot time = round-trip time + time required to send the jam sequence 
The slot time in Ethernet is defined in bits. It is the time required for a station to send 512 bits. 
This means that the actual slot time depends on the data rate; for traditional 10-Mbps Ethernet it 
is 51.2ps. 
Slot Time and Collision The choice of a 512-bit slot time was not accidental. It was chosen to 
allow the proper functioning of CSMA/CD. 
Slot Time and Maximum Network Length There is a relationship between the slot time and 
the maximum length of the network (collision domain). It is dependent on the propagation speed 


of the signal in the particular medium. 
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In most transmission media, the signal propagates at 2x 10° m/s (two-thirds of the rate for 


propagation in air). For traditional Ethernet, we calculate 


MarLang = PropapionS peed SO 


MaiLength = (2 108) x (51.2x 10/2) =5120'm 
Of course, we need to consider the delay times in repeaters and interfaces, and the time required 
to send the jam sequence. These reduce the maximum-length of a traditional Ethernet network to 


2500 m, just 48 percent of the theoretical calculation. 


MaxLength = 2500 m 


Physical Layer 


The Standard Ethernet defines several physical layer implementations; four of the most common, 
are shown in figure. 


Standard Ethernet 
common 





implementations 








I I I ] 
10Base5 | 10Base2 | 10Base-T | 10Base-F | 


Bus, Bus, Star, Star, 
thick coaxial thin coaxial UTP fiber 


Encoding and Decoding 
At the sender, data are converted to a digital signal using the Manchester scheme; at the 
receiver, the received signal is interpreted as Manchester and decoded into data. 


The figure shows the encoding scheme for Standard Ethernet. 





10 Mbps data 10 Mbps data 


Manchester Manchester 
encoder decoder 


Station 





Twisted pairs or fibers 
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10Base5: Thick Ethernet 


10 Mbps 500 m 











=| 
ico 
— 
— 















Transceiver cable 
maximum 50m 







Cable 
end 


Baseband Cable 
(digital) end 








Transceiver Thick coaxial cable 
maximum 500 m 


¢ Known as Thicknet 

© Thick coaxial cable 

e Uses bus topology with external transceiver: transceiver is responsible for transmitting, 
receiving, and detecting collisions. 


e¢ Max length of each segment 500m 


10Base2: Thin Ethernet 

e Knows as Thin Ethernet 

e Uses bus topology with thin and more flexible cable 

e Uses internal Transceiver — part of NIC(network interface card) which is installed inside 
the station. 

e Max length of each segment 185m due to the high level of attenuation in thin coaxial 
cable. 

e This implementation is more cost effective than 10Base5 because thin coaxial cable is 


less expensive than thick coaxial and the tee connections are much cheaper than taps. 


Cable 
end 


10 Mbps 185m 





Baseband 
(digital) 














Thin coaxial cable, 
maximum 185 m 
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10Base- T: Twisted-Pair Ethernet 
e Knows as twisted pair Ethernet 
e Uses physical star topology 
e Stations connected to hub via two pairs of twisted cable. 
e Max length 100m 


© Collision here happens in the hub. 











= 
ooo) 
10Bas' — 
ee 
10 Mbps Twisted pair (a Te] [es] 
| I 








Baseband Two pairs of 
(digital) UTP cable 


10Base-T hub 
10Base-F: Fiber Ethernet 
e 10Base-F uses a star topology to connect stations to a hub. 


e The stations are connected to the hub using two fiber-optic cables. 








10Base-F 
10 Mbps Fiber Two fiber-optic 
Baseband a 
(digital [oo] [oo] 





10Base-F hub 


Summary of Standard Ethernet implementations 














Characteristics L0Base5 10Base2 10Base-T 10Base-F 
Media Thick Thin 2 UTP 2 Fiber 
coaxialcable | coaxial cable 
Maximum length 500 m 185 m 100 m 2000 m 
Line encoding Manchester Manchester Manchester Manchester 
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4.5 CHANGES IN THE STANDARD 


Bridged Ethernet 
Bridges have two effects on an Ethernet LAN: They raise the bandwidth and they separate 


collision domains. 


Raising the Bandwidth 

e In an unabridged Ethernet network, the total capacity (10 Mbps) is shared among all stations 
with a frame to send; the stations share the bandwidth of the network. 

e If only one station has frames to send, it benefits from the total capacity (10 Mbps). But if 
more than one station needs to use the network, the capacity is shared. 

e A bridge divides the network into two or more networks. Bandwidth-wise, each network is 
independent. 

For example, in the figure below, a network with 12 stations is divided into two networks, each 

with 6 stations. Now each network has a capacity of 10 Mbps. The 10-Mbps capacity in each 

segment is now shared between 6 stations (actually 7 because the bridge acts as a station in each 

segment), not 12 stations. 

e Ina network with a heavy load, each station theoretically is offered 10/6 Mbps instead of 


10/12 Mbps, assuming that the traffic is not going through the bridge. 























b. With bridging 


Separating Collision Domains 
e Another advantage of a bridge is the separation of the collision domain. 


e The figure below shows the collision domains for an unbridged and a bridged network. 
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e You can see that the collision domain becomes much smaller and the probability of collision 
is reduced tremendously. Without bridging, 12 stations contend for access to the medium; 


with bridging only 3 stations contend for access to the medium. 















































‘a. Without bridging 
































b. With bridging 











Switched Ethernet 
e The idea of a bridged LAN can be extended to a switched LAN. 
e In other words, if we can have a multiple-port bridge, It has an N-port switch, In this way, the 


bandwidth is shared only between the station and the switch (5 Mbps each). 
Switch 








Full-Duplex Ethernet 

e One of the limitations of 10Base5 and 10Base2 is that communication is half-duplex 
(10Base-T is always full-duplex); a station can either send or receive, but may not do both at 
the same time. 

e The next step in the evolution was to move from switched Ethernet to full-duplex switched 


Ethernet. The full-duplex mode increases the capacity of each domain from 10 to 20 Mbps. 
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e The figure below shows a switched Ethernet in full-duplex mode. Note that instead of using 
one link between the station and the switch, the configuration uses two links: one to transmit 


and one to receive. 





go Transmit Transmit a! 
— eee 2g eee: 

==) << —_ 
Sammi Receive Receive mmm. 











4.6 FAST ETHERNET 


The goals of Fast Ethernet can be summarized as follows: 
1. Upgrade the data rate to 100 Mbps. 
2. Make it compatible with Standard Ethernet. 
3. Keep the same 48-bit address. 
4. Keep the same frame format. 
5. Keep the same minimum and maximum frame lengths. 


MAC Sublayer 

e For the star topology, there are two choices, as we saw before: half duplex and full duplex. 

¢ In the half-duplex approach, the stations are connected via a hub; in the full-duplex approach, 
the connection is made via a switch with buffers at each port. 

e The access method is the same (CSMA/CD) for the half-duplex approach; 

e For full-duplex Fast Ethernet, there is no need for CSMA/CD. 


Autonegotiation 
A new feature added to Fast Ethernet is called autonegotiation. It allows a station or a hub a 
range of capabilities. Autonegotiation allows two devices to negotiate the mode or data rate of 


operation. 
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Physical Layer 

Topology 

Fast Ethernet is designed to connect two or more stations together. 

e If there are only two stations, they can be connected point-to-point. 

e Three or more stations need to be connected in a star topology with a hub or a switch at the 


center 

















2.2 


a. Point-to-point 








Implemetation 


Common Fast Ethernet 
mplementations 


100Base-TX | 100Base-FX | 100Base-T4 | 











Two wires Two wires Four wires 
category 5 UTP fiber category 3 UTP 
100Base-TX: 


e Uses two pairs of twisted-pair cable (either category 5 UTP or STP). 

e MLT-3 scheme was selected since it has good bandwidth performance. 

e 4B/5B block coding is used to provide bit synchronization. 

e This creates a data rate of 125 Mbps, which is fed into MLT-3 for encoding. 


100Base-FX. 
e Uses two pairs of fiber-optic cables. Optical fiber can easily handle high bandwidth 


requirements by using simple encoding schemes. 
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NRZ-I encoding scheme for this implementation. However, NRZ-I has a_ bit 
synchronization problem for long sequences of 0s (or 1s, based on the encoding). 

To overcome this problem, the designers used 4B/5B block encoding. 

The block encoding increases the bit rate from 100 to 125 Mbps, which can easily be 


handled by fiber-optic cable. 


100Base-T4, 


Was designed to use category 3 or higher UTP. The implementation uses four pairs of 
UTP for transmitting 100 Mbps. 

Encoding/decoding in 100Base-T4 is more complicated. As this implementation uses 
category 3 UTP, each twisted-pair cannot easily handle more than 25 Mbaud. 

In this design, one pair switches between sending and receiving. Three pairs of UTP 
category 3, however, can handle only 75 Mbaud (25 Mbaud) each. We need to use an 
encoding scheme that converts 100 Mbps to a 75 Mbaud signal. 

8B/6T satisfies this requirement. In 8B/6T, eight data elements are encoded as six signal 


elements. This means that 100 Mbps uses only (6/8) x 100 Mbps, or 75 Mbaud. 


Summary of Fast Ethernet implementations 



































Characteristics 100Base-TX 100Base-FX 100Base-T4 
Media Cat 5 UTP or STP Fiber Cat 4 UTP 
Number of wires 2 2 4 
Maximum length 100 m 100 m 100 m 
Block encoding 4B/5B 4B/5B 
Line encoding MLT-3 NRZ-I 8B/6T 

4.7 GIGABIT ETHERNET 


The goals of the Gigabit Ethernet design can be summarized as follows: 


1. Upgrade the data rate to 1 Gbps. 


2. Make it compatible with Standard or Fast Ethernet. 
3. Use the same 48-bit address. 
4. 


Use the same frame format. 
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5. Keep the same minimum and maximum frame lengths. 


6. To support autonegotiation as defined in Fast Ethernet. 


MAC Sublayer 
Gigabit Ethernet has two distinctive approaches for medium access: half-duplex and full-duplex. 


Almost all implementations of Gigabit Ethernet follow the full-duplex approach. 


Full-Duplex Mode 
In full-duplex mode, there is a central switch connected to all computers or other switches. In 
this mode, each switch has buffers for each input port in which data are stored until they are 


transmitted. There is no collision in this mode. This means that CSMA/CD is not used. 


Half-Duplex Mode 
Gigabit Ethernet can also be used in half-duplex mode, although it is rare. In this case, a switch 
can be replaced by a hub, which acts as the common cable in which a collision might occur. The 


half-duplex approach uses CSMA/CD. 


Physical Layer 
Topology 
e If there are only two stations, they can be connected point-to-point. 
e Three or more stations need to be connected in a star topology with a hub or a switch at 


the center. 
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i Hierarchy of stars 


Implementation 


Gigabit Ethernet 
implementations 








1000Base-SX | 1000Base-LX | 1000Base-CX | 1000Base-T | 














Two-wire Two-wire Two-wire Four-wire 
short-wave fiber long-wave fiber copper (STP) UTP 
Summary of Ten-Gigabit Ethernet implementations 
Characteristics 10GBase-S 10GBase-L 10GBase-E 
Media Short-wave Long-wave Extended 
850-nm 1310-nm 1550-mm 
multimode single mode single mode 
Maximum length 300m 10km 40 km 




















4.8 Ten-Gigabit Ethernet 

The goals of the Ten-Gigabit Ethernet design can be summarized as follows: 
Upgrade the data rate to 10 Gbps. 

Make it compatible with Standard, Fast, and Gigabit Ethernet. 

Use the same 48-bit address. 

Use the same frame format. 


Keep the same minimum and maximum frame lengths. 


Nw se WY SE 


Allow the interconnection of existing LANs into a metropolitan area network (MAN) ora 
wide area network (WAN). 
7. Make Ethernet compatible with technologies such as Frame Relay and ATM. 
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MAC Sublayer 
Ten-Gigabit Ethernet operates only in full duplex mode which means there is no need for 


contention; CSMA/CD is not used in Ten-Gigabit Ethernet. 
Physical Layer 
The physical layer in Ten-Gigabit Ethernet is designed for using fiber-optic cable over long 


distances. Three implementations are the most common: 10GBase-S, 10GBase-L, and 10GBase-E. 


Summary of Ten-Gigabit Ethernet implementations 





























Characteristics 10GBase-S 10GBase-L 10GBase-E 
Media Short-wave Long-wave Extended 
850-nm 1310-nm 1550-mm 
multimode single mode single mode 
Maximum length 300m 10km 40 km 
4.9 IEEE 802.11 


IEEE has defined the specifications for a wireless LAN, called IEEE 802.11, which covers the 
physical and data link layers. 


Architecture 
The standard defines two kinds of services: the Basic Service Set (BSS) and the Extended 


Service Set (ESS). 


Basic Service Set 
e IEEE 802.11 defines the basic service set (BSS) as the building block of a wireless LAN. 
e A basic service set is made of stationary or mobile wireless stations and an optional central 


base station, known as the access point (AP). 
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BSS: Basic service set 
AP: Access point 






Station 


=) ewe 6h 

ee maa == 
Station Station Station 
Ad hoc network (BSS without an AP) Infrastructure (BSS with an AP) _ 


The BSS without an AP is a stand-alone network and cannot send data to other BSSs. It is 
called an ad hoc architecture. In this architecture, stations can form a network without the 
need of an AP; they can locate one another and agree to be part of a BSS. 


A BSS with an AP is sometimes referred to as an infrastructure network. 


Extended Service Set 


An extended service set (ESS) is made up of two or more BSSs with APs. 

In this case, the BSSs are connected through a distribution system, which is usually a wired 
LAN. 

The distribution system connects the APs in the BSSs. IEEE 802.11 does not restrict the 
distribution system; it can be any IEEE LAN such as an Ethernet. 

The extended service set uses two types of stations: mobile and stationary. The mobile 
stations are normal stations inside a BSS. The stationary stations are AP stations that are part 


of a wired LAN. 


ESS: Extended service set 
BSS: Basic service set 
AP: Access point 












Station Types 
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e IEEE 802.11 defines three types of stations based on their mobility in a wireless LAN: no- 
transition, BSS-transition, and ESS-transition mobility. 

e A station with no-transition mobility is either stationary (not moving) or moving only inside 
aBSS. 

e A station with BSS-transition mobility can move from one BSS to another, but the movement 
is confined inside one ESS. 

e A station with ESS-transition mobility can move from one ESS to another. However, IEEE 
802.11 does not guarantee that communication is continuous during the move. 

MAC Sublayer 

TEEE 802.11 defines two MAC sublayers: the distributed coordination function (DCF) and point 


coordination function (PCF). 





Contention-free 




















Data link SEIvIcS Contention 
layer service 
Point coordination function (PCF) 
MAC 
sublayer 
Distributed coordination function (DCF) 
Physical 802.11 802.11 802.11 802.11a | 802.11a | 802.11g 
layer FHSS Dsss Infrared Dsss OFDM Dsss 


























Distributed Coordination Function 





One of the two protocols defined by IEEE at the MAC sublayer is called the distributed 

coordination function (DCF). DCF uses CSMA/CA as the access method. Wireless LANs cannot 

implement CSMA/CD for three reasons: 

1. For collision detection a station must be able to send data and receive collision signals at the 
same time. This can mean costly stations and increased bandwidth requirements. 

2. Collision may not be detected because of the hidden station problem. 


3. The distance between stations can be great. Signal fading 


Process Flowchart The figure shows the process flowchart for CSMA/CA as used in wireless 
LANs. 
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Set back-off 
to zero 
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Frame Exchange Time Line The figure shows the exchange of data and control frames in time. 


Source Destination All other stations 





SIFS 











Y 


Time Time Time Time 
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1. Before sending a frame, the source station senses the medium by checking the energy level at 
the carrier frequency. 

a. The channel uses a persistence strategy with back-off until the channel is idle. 

b. After the station is found to be idle, the station waits for a period of time called the 
distributed interframe space (DIFS); then the station sends a control frame called the 
request to send (RTS). 

2. After receiving the RTS and waiting a period of time called the short interframe space 
(SIFS), the destination station sends a control frame, called the clear to send (CTS), to the 
source station. This control frame indicates that the destination station is ready to receive 
data. 

3. The source station sends data after waiting an amount of time equal to SIFS. 

4, The destination station, after waiting an amount of time equal to SIFS, sends an 
acknowledgment to show that the frame has been received. Acknowledgment is needed in 
this protocol because the station does not have any means to check for the successful arrival 
of its data at the destination. On the other hand, the lack of collision in CSMA/CD is a kind 


of indication to the soume that data have arrived. 


Network Allocation Vector: 

e When a station sends an RTS frame, it includes the duration of time that it needs to occupy 
the channel. 

e The stations that are affected by this transmission create a timer called a network allocation 
vector (NAV) that shows how much time must pass before these stations are allowed to 
check the channel for idleness. 

e Each time a station accesses the system and sends an RTS frame, other stations start their 
NAV. In other words, each station, before sensing the physical medium to see if it is idle, 


first checks its NAV to see if it has expired. 


Collision during Handshaking 
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What happens if there is collision during the time when RTS or CTS control frames are in 
transition, often called the handshaking period? Two or more stations may try to send RTS 
frames at the same time. 

These control frames may collide. However, because there is no mechanism for collision 
detection, the sender assumes there has been a collision if it has not received a CTS frame from 


the receiver. The back-off strategy is employed, and the sender tries again. 


Point Coordination Function (PCF) 

e The point coordination function (PCF) is an optional access method that can be implemented 
in an infrastructure network (not in an ad hoc network). 

e It is implemented on top of the DCF and is used mostly for time-sensitive transmission. 

e PCF has a centralized, contention-free polling access method. 

e The AP performs polling for stations that are capable of being polled. The stations are polled 
one after another, sending any data they have to the AP. 

e To give priority to PCF over DCF, another set of interframe spaces has been defined: PIFS 
and SIFS. The SIFS is the same as that in DCF, but the PIFS (PCF IFS) is shorter than the 
DIFS. This means that if, at the same time, a station wants to use only DCF and an AP wants 
to use PCF, the AP has priority. 

e Due to the priority of PCF over DCF, stations that only use DCF may not gain access to the 
medium. To prevent this, a repetition interval has been designed to cover both contention- 
free (PCF) and contention-based (DCF) traffic. T 

e he repetition interval, which is repeated continuously, starts with a special control frame, 
called a beacon frame. When the stations hear the beacon frame, they start their NAV for the 
duration of the contention-free period of the repetition interval. The figure shows an example 


of a repetition interval. 
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B: Beacon frame Repetition interval 









CF: Contention-free Contention-free Contention 











SIFS 
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Fragmentation 
The wireless environment is very noisy; a corrupt frame has to be retransmitted. The protocol, 
therefore, recommends fragmentation--the division of a large frame into smaller ones. It is more 


efficient to resend a small frame than a large one. 


Frame Format 


The MAC layer frame consists of nine fields, as shown. 


2bytes 2bytes Gbytes Gbytes Gbytes 2bytes 6 bytes Oto2312 bytes bytes 





FC Address 1 | Address2 | Address 3 SC | Address 4 Frame body 


To |From|More More 
Slee 6 |e en a |] er 


2 bits 2 bits 4 bits Tbit Tbit Tbit Tbit Tbit Tbit 1bit 1 bit 









Protocol 
version 








e Frame control (FC). The FC field is 2 bytes long and defines the type of frame and some 


control information. The table describes the subfields. 
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Field Explanation 
Version Current version is 0 

Type Type of information: management (00), control (01), or data (10) 
Subtype Subtype of each type (see Table 14.2) 

To DS Defined later 

From DS Defined later 

More flag When set to 1, means more fragments 

Retry When set to 1, means retransmitted frame 

Pwr mgt When set to 1, means station is in power management mode 
More data When set to 1, means station has more data to send 

WEP Wired equivalent privacy (encryption implemented) 

Rsvd Reserved 














e ID. In all frame types except one, this field defines the duration of the transmission that is 
used to set the value of NAV. In one control frame, this field defines the ID of the frame. 

e Addresses. There are four address fields, each 6 bytes long. The meaning of each address 
field depends on the value of the To DS and From DS subfields and will be discussed later. 

© Sequence control. This field defines the sequence number of the frame to be used in flow 
control. 

e Frame body. This field, which can be between 0 and 2312 bytes, contains information based 
on the type and the subtype defined in the FC field. 

e FCS. The FCS field is 4 bytes long and contains a CRC-32 error detection sequence. 


Frame Types 

A wireless LAN defined by IEEE 802.11 has three categories of frames: management frames, 

control frames, and data frames. 

e Management Frames Management frames are used for the initial communication between 
stations and access points. 

© Control Frames Control frames are used for accessing the channel and acknowledging 


frames. The figure shows the format. 





2bytes 2bytes 6 bytes 6 bytes 4 bytes 2bytes 2bytes 6 bytes 4 bytes 
FC Lo] Address 1 | Address 2 Lis | 
RTS CTS or ACK 
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For control frames the value of the type field is 01; the values of the subtype fields for frames we 


have discussed are shown in the table. 

















Subtype Meaning 
1011 Request to send (RTS) 
1100 Clear to send (CTS) 
1101 Acknowledgment (ACK) 














e Data Frames Data frames are used for carrying data and control information. 


Addressing Mechanism 


The IEEE 802.11 addressing mechanism specifies four cases, defined by the value of the two 
flags in the FC field, To DS and From DS. Each flag can be either 0 or 1, resulting in four 
different situations. The interpretation of the four addresses (address 1 to address 4) in the MAC 


frame depends on the value of these flags, as shown in the table. 

















To From Address Address Address Address 
DS DS iI 2 3 4 

0 0 Destination Source BSS ID N/A 

0 1 Destination Sending AP Source N/A 

1 0 Receiving AP Source Destination N/A 

1 1 Receiving AP Sending AP Destination Source 


























Note that address | is always the address of the next device. Address 2 is always the address of 
the previous device. Address 3 is the address of the final destination station if it is not defined by 


address 1. Address 4 is the address of the original source station if it is not the same as address 2. 


Case 1:00 In this case, To DS = 0 and From DS = 0. This means that the frame is not going to a 
distribution system (To DS = 0) and is not coming from a distribution system (From DS = 0). 
The frame is going from one station in a BSS to another without passing through the distribution 


system. The ACK frame should be sent to the original sender. The addresses are shown in figure. 
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Case 2:01 In this case, To DS = 0 and From DS = 1. This means that the frame is coming from a 
distribution system (From DS = 1). The frame is coming from an AP and going to a station. The 
ACK should be sent to the AP. The addresses are as shown in Figure 14.9. Note that address 3 


contains the original sender of the frame (in another BSS). 


Case 3:10 In this case, To DS = 1 and From DS = 0. This means that the frame is going to a 
distribution system (To DS = 1). The frame is going from a station to an AP. The ACK is sent to 
the original station. The addresses are as shown in figure above. Note that address 3 contains the 


final destination of the frame (in another BSS). 


Case 4:11 In this case, To DS = 1 and From DS = 1. This is the case in which the distribution 
system is also wireless. The frame is going from one AP to another AP in a wire-less distribution 
system. We do not need to define addresses if the distribution system is a wired LAN because the 
frame in these cases has the format of a wired LAN frame (Ethernet, for example). Here, we 
need four addresses to define the original sender, the final destination, and two intermediate 


APs. Figure above shows the situation. 
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Hidden and Exposed Station Problems 
Hidden Station Problem 


The figure below shows an example of the hidden station problem. 





Station B has a transmission range shown by the left oval (sphere in space); every station in this 
range can hear any signal transmitted by station B. 

Station C has a transmission range shown by the right oval (sphere in space); every station 
located in this range can hear any signal transmitted by C. 

Station C is outside the transmission range of B; likewise, station B is outside the transmission 
range of C. 

Station A, however, is in the area covered by both B and C; it can hear any signal transmitted by 
BorC. 

Assume that station B is sending data to station A. In the middle of this transmission, station C 
also has data to send to station A. However, station C is out of B's range and transmissions from 
B cannot reach C. Therefore C thinks the medium is free. Station C sends its data to A, which 
results in a collision at A because this station is receiving data from both B and C. In this case, 
we say that stations B and C are hidden from each other with respect to A. 

Hidden stations can reduce the capacity of the network because of the possibility of collision. 
The solution to the hidden station problem is the use of the handshake frames (RTS and CTS) 
The figure shows that the RTS message from B reaches A, but not C. However, because both B 
and C are within the range of A, the CTS message, which contains the duration of data 
transmission from B to A reaches C. 

Station C knows that some hidden station is using the channel and refrains from transmitting 


until that duration is over. 
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Exposed Station Problem Now consider a situation that is the inverse of the previous one: the 
exposed station problem. In this problem a station refrains from using a channel when it is, in 
fact, available. In the figure, station A is transmitting to station B. Station C has some data to 
send to station D, which can be sent without interfering with the transmission from A to B. 
However, station C is exposed to transmission from 

A; it hears what A is sending and thus refrains from sending. In other words, C is too 
conservative and wastes the capacity of the channel. 


Range 
ofA 





The handshaking messages RTS and CTS cannot help in this case, despite what you might think. 
Station C hears the RTS from A, but does not hear the CTS from B. Station C, after hearing the 
RTS from A, can wait for a time so that the CTS from B reaches A; it then sends an RTS to D to 
show that it needs to communicate with D. Both stations B and A may hear this RTS, but station 
A is in the sending state, not the receiving state. Station B, however, responds with a CTS. The 
problem is here. If station A has started sending its data, station C cannot hear the CTS from 


station D because of the collision; it cannot send its data to D. It remains exposed until A finishes 
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Physical layer: 


















































IEEE Technique Band Modulation Rate (Mbps) 
802.11 FHSS 2.4 GHz FSK land 2 
DSSS 2.4 GHz PSK | and2 
Infrared PPM 1 and 2 
802.11la OFDM 5.725 GHz PSK or QAM 6 to 54 
802.11b DSSS 2.4 GHz PSK 5.5 and 11 
802.11g OFDM 2.4 GHz Different 22 and 54 
The 2.4GHz ISM band is divided into 79 bands of IMHz 
Industrial-Scientific-Medical (ISM) band 
26 83.5 125 
/MHz, ‘ MHz r F MHz F 
— ———— $+ 
902 928 24 2.4835 5.725 5.850 Frequency 
MHz MHz GHz GHz GHz GHz 
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+ In Frequency Hopping Spread Spectrum (FHSS) the sender sends on one carrier frequency 
for a short amount of time, then hops to another carrier frequency for the same amount of 
time, and so on. After N hop-pings, the cycle is repeated. 


+ Spreading makes it difficult for unauthorized persons to make sense of the transmitted data 





1 or 2 Mbps Modulator 1-MHz 


Digital Analog 
Frequency 
synthetizer 


data signal 
+ In Direct Sequence Spread Spectrum (DSSS) each bit sent by the sender is replaced by a 























sequence of bits called a chip code. 
+ To avoid buffering, the time needed to send one chip code must be the same as the time 
needed to send one original bit. 


+ DSSS is implemented at the physical layer and uses a 2.4GHz ISM band 







Modulator 









1 or2 Mbps 11-MHz 
Digital 11-Chip 11 of 22 Mbps Analog 
data Barker sequence signal 





+ TEEE 802.1 1a describes the orthogonal frequency-division multiplexing (OFDM) method for 
signal generation in the 5GHz ISM band 
+ OFDM is the same as FDM with one major difference: 
+ All the subbands are used by one source at a given time 
+ Sources conend with one another at the data link layer for access 


+ OFDM uses PSK (18Mbps) and QAM (54Mbps) for modulation 
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+ IEEE 802.11b describes the high-rate DSSS method for signal generation at 2.4GHz ISM 
band. 

+ This is similar to DSSS except for the encoding method, which is called complementary 
code keying (CCK) 

+ CCK encodes 4 or 8 bits to one CCK symbol 









5.5 Mbps: 2 bits 







cock 
selector 


5.5 or 11 Mbps: 6 bits 

11 Mbps tilssctel ete 11-MHz 

Digital Analog 
data > signal 











+ [IEEE 802.11 FHSS(Frequency-hopping spread spectrum) 
© Operating in 2.4 GHz ISM band 
e Lower cost, power consumption 


¢ Most tolerant to signal interference 


%* IEEE 802.11 DSSS (Direct-sequence spread spectrum) 
© Operating in 2.4 GHz ISM band 
e Supports higher data rates 
¢ More range than FH or IR physical layers 

* IEEE 802.11 Infrared 


e Lowest cost 
e Lowest range compared to spread spectrum 
e Doesn’t penetrate walls, so no eavesdropping 


* IEEE 802.1la 


° Makes use of 5-GHz band 

e Provides rates of 6, 9 , 12, 18, 24, 36, 48, 54 Mbps 

e Uses orthogonal frequency division multiplexing (OFDM) 
+ IEEE 802.11b 

© 802.11b operates in 2.4 GHz band 
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e Provides data rates of 5.5 and 11 Mbps 

¢ Complementary code keying (CCK) modulation scheme 
+ IEEE 802.11g 

© 802.1 1g operates in 2.4 GHz band 

e Provides data rates of 22 and 54 Mbps 

e Uses orthogonal frequency division multiplexing (OFDM) 


4.10 Bluetooth 


Bluetooth is a wireless LAN technology designed to connect devices of different functions such 

as telephones, notebooks, computers, cameras, printers, coffee makers, and so on. A Bluetooth 

LAN is an ad hoc network, which means that the network is formed spontaneously. 

Architecture 

Bluetooth defines two types of networks: piconet and scatternet. 

Piconet: 

+ A Bluetooth network is called a piconet, or a small net. 

+ Itcan have up to eight stations, one of which is called the master; the rest are called slaves. 

+ Maximum of seven slaves. Only one master. 

+ Slaves synchronize their clocks and hopping sequence with the master. 
But an additional eight slaves can stay in parked state, which means they can be 
synchronized with the master but cannot take part in communication until it is moved from 
the parked state 


Piconet 


Secondary Ps eA ss 


Scatternet 
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+ Piconets can be combined to form what is called a scatternet. 
+ Aslave station in one piconet can become the master in another piconet. 


+ Bluetooth devices has a built-in short-range radio transmitter. 





Piconet 
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Bluetooth layers 
+ Radio Layer: Roughly equivalent to physical layer of the Internet model. Physical links can 
be synchronous or asynchronous. 
+ Uses Frequency-hopping spread spectrum [Changing frequency of usage]. Changes it 
modulation frequency 1600 times per second. 
+ Uses frequency shift keying (FSK )with Gaussian bandwidth filtering to transform 
bits to a signal. 
+ Baseband layer: Roughly equivalent to MAC sublayer in LANs. Access is using Time 
Division (Time slots). 
+ Length of time slot = dwell time = 625 microsec. So, during one frequency, a sender 
sends a frame to a slave, or a slave sends a frame to the master. 
+ Time division duplexing TDMA (TDD-TDMA) is a kind of half-duplex 
communication in which the slave and receiver send and receive data, but not at the 
same time (half-duplex). However, the communication for each direction uses 


different hops, like walkie-talkies. 
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Baseband layer 





Single-secondary communication 
+ Master uses even-numbered slots 
* Slave uses odd-numbered slots 
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Multiple-secondary communication also called Multiple-slave communication 
+ Master uses even-numbered slots 
+ Slave sends in the next odd-numbered slot if the packet in the previous slot was 


addressed to it. 
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Physical Links 
+ Synchronous connection-oriented (SCO) 
+ Latency is important than integrity. 
+ Transmission using slots. 
* No retransmission. 
+ Asynchronous connectionless link (ACL) 
+ Integrity is important than latency. 
* Does like multiple-slave communication. 
* Retransmission is done. 
Frame format 











72 bits 54 bits Oto N bits 
Data 












N = 240 for 1-slot frame 
N = 1490 for 3-slot frame 


N = 2740 for 5-slot frame 


3bits 4bits 111 8 bits 
This 18-bit part is repeated 3 times. 
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Access code. This 72-bit field normally contains synchronization bits and the identifier of the 

primary to distinguish the frame of one piconet from another. 

Header. This 54-bit field is a repeated I8-bit pattern. Each pattern has the following 

subfields: 

1. Address. The 3-bit address subfield can define up to seven secondaries (1 to 7). If the 
address is zero, it is used for broadcast communication from the primary to all 
secondaries. 

2. Type. The 4-bit type subfield defines the type of data coming from the upper layers. 

3. F. This 1-bit subfield is for flow control. When set (I), it indicates that the device is 
unable to receive more frames (buffer is full). 

A. This 1-bit subfield is for acknowledgment. Bluetooth uses Stop-and-Wait ARQ; | 
bit is sufficient for acknowledgment. 

4. S. This 1-bit subfield holds a sequence number. Bluetooth uses Stop-and-Wait ARQ; 1 
bit is sufficient for sequence numbering. 

5. HEC. The 8-bit header error correction subfield is a checksum to detect errors in each 18- 
bit header section. 

The header has three identical 18-bit sections. The receiver compares these three sections, bit by bit. 

If each of the corresponding bits is the same, the bit is accepted; if not, the majority opinion rules. 

This is a form of forward error correction (for the header only). This double error control is needed 

because the nature of the communication, via air, is very noisy. Note that there is no retransmission in 

this sublayer. 

Payload. This subfield can be 0 to 2740 bits long. It contains data or control information 


corning from the upper layers. 


L2CAP (Logical Link Control and Adaptation Protocol) 


+ Equivalent to LLC sublayer in LANs. 
+ Used for data exchange on ACL Link. SCO channels do not use L2CAP. 
+ Frame format has 16-bit length [Size of data coming from upper layer in bytes], channel 


ID, data and control. 
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+ Can do Multiplexing, segmentation and Reassembly, QoS [with no QoS, best-effort 
delivery is provided] and Group mangement [Can do like multicast group, using some 


kind of logical addresses]. 


L2CAP data packet format 


2 bytes 2 bytes 0 to 65,535 bytes 














‘Channel Dd) Data and control 
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Module — 3 
NETWORK LAYER 


Structure of a Router 


A high-level view of a generic router architecture is shown below. Four router components can 


be identified: 
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e Input ports: An input port performs several key functions. 

It performs the physical layer function of terminating an incoming physical link at a router. 
An input port also performs link-layer functions needed to interoperate with the link layer at 
the other side of the incoming link. 

The lookup function is also performed at the input port. 

Forwarding table is consulted to determine the router output port to which an arriving packet 
will be forwarded via the switching fabric. 

e Switching fabric: The switching fabric connects the router’s input ports to its output ports. 

e Output port:. An output port stores packets received from the switching fabricand transmits 
these packets on the outgoing link by performing the necessary link-layer and physical-layer 
functions. 

e Routing processor: The routing processor executes the routing protocols maintains routing 
tables and attached link state information, and computes the forwarding table for the router. It 


also performs the network management functions. 
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A router’s input ports, output ports, and switching fabric together implement the forwarding 
function and are almost always implemented in hardware. These forwarding functions are 
sometimes collectively referred to as the router forwarding plane. 

Router control plane functions are usually implemented in software and execute on the 


routing processor 


Input Processing 














Data link Lookup, fowarding, 
equnetion lel protocol |_| queuing 
decapsulation) wrettretttty 























The input port’s line termination function and link-layer processing implement the physical 
and link layers for that individual input link. 

The lookup performed in the input port is central to the router’s operation—it is here that the 
router uses the forwarding table to look up the output port to which an arriving packet will be 
forwarded via the switching fabric. 

The forwarding table is computed and updated by the routing processor, with a shadow copy 
typically stored at each input port. The forwarding table is copied from the routing processor 
to the line cards over a separate bus. 

Once a packet’s output port has been determined via the lookup, the packet can be sent into 
the switching fabric. In some designs, a packet may be temporarily blocked from entering the 
switching fabric if packets from other input ports are currently using the fabric. A blocked 
packet will be queued at the input port and then scheduled to cross the fabric at a later point 


in time. 


Switching 


The switching fabric switches the packet from an input port to an output port. Switching can be 


accomplished in a number of ways: 
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1. Switching via memory: 

e The simplest, earliest routers were traditional computers, with switching between input 
and output ports being done under direct control of the CPU (routing processor). 

e Input and output ports functioned as traditional I/O devices in a traditional operating 
system. 

e An input port with an arriving packet first signaled the routing processor via an interrupt. 

e The packet was then copied from the input port into processor memory. 

e The routing processor then extracted the destination address from the header, looked up 
the appropriate output port in the forwarding table, and copied the packet to the output 
port’s buffers. 

e Here two packets cannot be forwarded at the same time, even if they have different 
destination ports, since only one memory read/write over the shared system bus can be 
done at a time. 

e Many modern routers switch via memory. A major difference from early routers is that 
the lookup of the destination address and the storing of the packet into the appropriate 


memory location are performed by processing on the input line cards. 






























































2. Switching via a bus: 

e In this approach, an input port transfers a packet directly to the output port over a shared 
bus, without intervention by the routing processor. 

e This is typically done by having the input port pre-pend a switch-internal label (header) 
to the packet indicating the local output port to which this packet is being transferred and 
transmitting the packet onto the bus. 

e The packet is received by all output ports, but only the port that matches the label will 
keep the packet. 

e The label is then removed at the output port, as this label is only used within the switch to 


cross the bus. 
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e If multiple packets arrive to the router at the same time, each at a different input port, all 


but one must wait since only one packet can cross the bus at a time. 


A x 
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3. Switching via an interconnection network: 

e One way to overcome the bandwidth limitation of a single, shared bus is to use a more 
sophisticated interconnection network, such as those that have been used in the past to 
interconnect processors in multiprocessor computer architecture. 

e A crossbar switch is an interconnection network consisting of 2N buses that connect N 
input ports to N output ports. 

e Each vertical bus intersects each horizontal bus at a crosspoint, which can be opened or 


closed at any time by the switch fabric. 




































































When a packet arrives from port A and needs to be forwarded to port Y, the switch controller 
closes the crosspoint at the intersection of busses A and Y, and port A then sends the packet 
onto its bus, which is picked up (only) by bus Y. Note that a packet from port B can be 
forwarded to port X at the same time, since the A-to-Y and B-to-X packets use different 
input and output busses. Thus, unlike the previous two switching approaches, crossbar 


networks are capable of forwarding multiple packets in parallel. However, if two packets 
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from two different input ports are destined to the same output port, then one will have to wait 


at the input, since only one packet can be sent over any given bus at a time. 


Output Processing 
Output port processing takes packets that have been stored in the output port’s memory and 
transmits them over the output link. This includes selecting and de-queuing packets for 


transmission, and performing the needed link layer and physical-layer transmission functions. 














/ Queuing (buffer Data link 

‘Switch management) processing Line 
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Queuing 

e Packet queues may form at both the input ports and the output ports. 

¢ The location and extent of queuing will depend on the traffic load, the relative speed of the 
switching fabric, and the line speed. 

e¢ When many packets arrive from same source at a faster rate than switching rate queuing at 
input port occurs. 

e When many packets are destined towards same output port queuing at output port occurs. 


Output port contention at time t 
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e¢ A consequence of output port queuing is that a packet scheduler at the output port must 
choose one packet among those queued for transmission. 

e Many packet scheduling algorithms like first-come-first-served (FCFS) scheduling, priority 
queuing, fair queuing or a more sophisticated scheduling discipline such as weighted fair 
queuing (WFQ) is available. 

e Packet scheduling plays a crucial role in providing quality-of-service guarantees. 

e Similarly, if there is not enough memory to buffer an incoming packet, a decision must be 
made to either drop the arriving packet (a policy known as drop-tail) or remove one or more 
already-queued packets to make room for the newly arrived packet. 

e In some cases, it may be advantageous to drop (or mark the header of) a packet before the 
buffer is full in order to provide a congestion signal to the sender. 

e A number of packet-dropping and -marking policies (which collectively have become known 
as active queue management (AQM) algorithms) have been proposed and analyzed. 

© One of the most widely studied and implemented AQM algorithms is the Random Early 
Detection (RED) algorithm. Under RED, a weighted average is maintained for the length of 
the output queue. 

e If the average queue length is less than a minimum threshold, minn, when a packet arrives, 
the packet is admitted to the queue. 

¢ Conversely, if the queue is full or the average queue length is greater than a maximum 
threshold, maxth, when a packet arrives, the packet is marked or dropped. 

e Finally, if the packet arrives to find an average queue length in the interval [minn, maxi], the 
packet is marked or dropped with a probability that is typically some function of the average 


queue length, minm, and maxin. 


Consider the following scenario: 

Suppose that in the below figure the switch fabric chooses to transfer the packet from the front of 
the upper-left queue. In this case, the darkly shaded packet in the lower-left queue must wait. But 
not only must this darkly shaded packet wait, so too must the lightly shaded packet that is queued 
behind that packet in the lower-left queue, even though there is no contention for the middle- 
right output port (the destination for the lightly shaded packet). This phenomenon is known as 
head-of-the-line (HOL) blocking. 
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Output port contention at time t— 
‘one dark packet can be transferred 
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The Internet Protocol (IP) 


Internet addressing and forwarding are important components of the Internet Protocol (IP). 


There are two versions of IP in use today: IPv4, IPv6. 


Datagram Format 





32 bits 
1 
r 1 
version {Eader type of service Datagram length (bytes) 
16-bit Identifier Flags 13-bit Fragmentation offset 
Time-to-live Uppes-iaver Header checksum. 


protocol 


32-bit Source IP address 
32-bit Destination IP address 
Options (if any) 


Data 
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Version number: These 4 bits specify the IP protocol version of the datagram. 

Header length: Because an IPv4 datagram can contain a variable number of options these 4 
bits specify the total header bytes. 

Type of service: The type of service (TOS) bits were included in the IPv4 header to allow 
different types of IP datagrams (for example, datagrams particularly requiring low delay, 
high throughput, or reliability) to be distinguished from each other. 

Datagram length: This is the total length of the IP datagram (header plus data), measured in 
bytes. 

Identification: this field represents identification number assigned to related fragments. 
Flag: there are three flags, first bit is unused, second bit is do not fragment bit, If this bit is 
set intermediate nodes should not perform fragmentation. Last bit is more fragment bit. More 
fragment bit represents more fragments to follow after this. 

Fragmentation offset: Starting byte of a fragment (In multiples of 8 bytes). 

Time-to-live: The time-to-live (TTL) field is included to ensure that datagrams do not 
circulate forever in the network. It represents hop limit. 

Protocol: Represents the upper layer protocol, 6 for TCP, 17 for UDP. 

Header checksum: field is used for error detection. 

Source and destination IP addresses: represents 32 bit source and destination IP address. 
Options: The options fields allow an IP header to be extended. It allows to include additional 
functionalities. 


Data (payload): represents the data that has to be transmitted. 


IP Datagram Fragmentation 


The maximum amount of data that a link-layer frame can carry is called the maximum 
transmission unit (MTU). Because each IP datagram is encapsulated within the link-layer 
frame for transport from one router to the next router, the MTU of the link-layer protocol 
places a hard limit on the length of an IP datagram. 

IP datagram is divided into smaller packets according to MTU. These smaller packets are 
called fragments and process is called fragmentation. 


Fragments need to be reassembled before they reach the transport layer at the destination. 
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e Receiver needs to reassemble all the fragments belong to same original IP datagram. In order 
to identify all the related fragments Identification field is used. 

e There are three flags, first bit is unused, second bit is do not fragment bit, If this bit is set 
intermediate nodes should not perform fragmentation. Last bit is more fragment bit. More 
fragment bit represents more fragments to follow after this. 

e In order for the destination host to determine whether a fragment is missing the offset field is 
used to specify where the fragment fits within the original IP datagram. 

Example: 

A datagram of 4,000 bytes (20 bytes of IP header plus 3,980 bytes of IP payload) arrives at a 

router and must be forwarded to a link with an MTU of 1,500 bytes. This implies that the 3,980 

data bytes in the original datagram must be allocated to three separate fragments. Suppose that 

the original datagram is stamped with an identification number of 777. Following table shows 


the fragmentation. 








Fragment Bytes 0 Offset Flog 
Ist fragment ‘1,480 bytes in identification = 777 offset = 0 (meaning the dato flag = 1 (meaning 
the data field of should be inserted beginning there is more) 
the IP datogram ot byte 0) 
‘2nd frogment ~—‘1,480 bytes identification = 777 offset = 185 (meaning the dato flog = 1 (meoning 
of data should be inserted beginning at byte there is more) 
1,480. Note that 185 - 8 = 1,480) 
31d frogment ‘1,020 bytes identification = 777 offset = 370 (meaning the data flag = 0 (meaning this 
(= 3,980-1,480-1,480) should be inserted beginning at byte is the last fragment) 
of data 2,960. Note that 370 - 8 = 2,960) 





At the destination, the payload of the datagram is passed to the transport layer only after the IP 
layer has fully reconstructed the original IP datagram. If one or more of the fragments does not 
arrive at the destination, the incomplete datagram is discarded and not passed to the transport 


layer. 


IPv4 Addressing 
e Each IP address is 32 bits long (equivalently, 4 bytes), and there are thus a total of 232 


possible IP addresses. Approximately there are about 4 billion possible IP addresses. 
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e IP addresses are typically written in so-called dotted-decimal notation, in which each byte of 
the address is written in its decimal form and is separated by a period (dot) from other bytes 
in the address. 

e Ex: 193.32.216.9 

e The address 193.32.216.9 in binary notation is 11000001 00100000 11011000 00001001 

¢ ch interface on every host and router in the global Internet must have an IP address that is 
globally unique. 

e IP address has 2 parts: Network ID and Host ID. Network ID is used to identify the network 
and Host ID is used to identify host in the network. 

e A network can be divided into smaller sub networks called subnets. 

e Initially IP address was divided into 5 classes. We call this as classful addressing. It leads to 
shortage of IP Address. Now Classless Inter Domain Routing (CIDR) addressing is used. In 
CIDR IP address is represented as a.b.c.d/x where x represents number of bits used for 
Network ID. 

e Example. 

If 256 hosts are there last 8 bit is allocated to host ID remaining 24 bit is allocated to network 
ID. Here /x value is /24. 


Below figure shows sub netting. 


223.1.1.1 a. 
223.114 223.129 223.1.2.1 
\eeqZ 
[~~ 223.1.3.27 
223.1.1.2 eo 


223.1.2.2 
223.1.1.3 


223.1.3.1 223.1.3.2 
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Dynamic Host Configuration Protocol 

¢ Once an organization has obtained a block of addresses, it can assign individual IP addresses 
to the host and router interfaces in its organization. 

e A-system administrator will typically manually configure the IP addresses into the. 

e Host addresses can also be configured manually, but more often this task is now done using 
the Dynamic Host Configuration Protocol (DHCP). 

e DHCP allows a host to obtain (be allocated) an IP address automatically. 


e DHCP works over UDP with port number 67. 


DHCP involves four steps: 

1) DHCP server discovery 

Host broadcast DHCP discovery message with source address 0.0.0.0 and destination address 
255.255.255.255 

2) DHCP server offer(s) 

A DHCP server receiving a DHCP discover message responds to the client with a DHCP offer 
message that is broadcast to all nodes on the subnet, again using the IP broadcast address of 
255.255.255.255. 

3) DHCP request 

The newly arriving client will choose from among one or more server offers and respond to its 
selected offer with a DHCP request message. 

4) DHCP ACK 

The server responds to the DHCP request message with a DHCP ACK message, confirming the 


requested parameters. 
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DHC? server: Arriving clien 


223.1.2.5 

















DHCP discover 
sc 0.0.0.0, 68 
Gest: 255.255.255.255,67 
DHCPDISCOVER 
idr: 0.0.0.0 
pes ID: 654 DHCP offer 
sc 223.125, 67 
Gest: 255.255.255.255,68 
DHCPOFFER 
yiaddrr: 223.1.2.4 
‘transaction ID: 654 
DHCP request ieee, so 




















src: 0.0.0.0, 68 
dest: 255.255.255.255, 67] 










DHCP server ID: 223.1.2.5 
lifetime: 3600 secs 







DHCP ACK 


src 223.12.5, 67 
dest: 255.255.255.255,68 





DHCP server ID: 223.1.2.5 
Lifetime: 3600 secs: 


ag 


Time Time 











Network Address Translation (NAT) 

¢ Private IP addresses shown below are used inside company/campus/organization or home 
network. 
10.0.0.0 to 10.255.255.255 
171.16.0.0 to 171.31.255.255 
192.168.0.0 to 192.168.255.255 

e But company/campus/organization or home network connect to internet through global IP 
address. 

e To convert from private IP address to global IP address and vice-versa NAT is used. 


e NAT can be illustrated with the following diagram. 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 12 


Module — 3: Network Layer 








NAT translation table 





WAN side LAN side 
138.76.29.7, 5001 10.0.0.1, 3345 

















| S= [s=t0001, 395” +3345 


D = 128.119.40.186, 80 — 2 
‘3 =138.76.29.7, 5001 
D=128.119.40.186, 80_, 
10.0.0. 4 10.0.0.2 
138.76.29.7 7S 
Sa KO— a eee 
D=10.0.0.1 
pre 3 


e Here the host with IP 10.0.0.1 sends the IP datagram with source address, port number 
10.0.0.1, 3345 and destination address, port number 128.119.40.186, 80. 














e NAT router maintains a NAT table as shown above 

e NAT router make an entry in its table and replaces the source IP address with global IP 
address 138.76.29.7 and port number 5001 and send it to destination. 

«¢ When the response comes from destination, the global IP address will be replaced by private 
IP address according to entry available in NAT table. Then the message is delivered to 


appropriate host. 


UPnP 

e NAT traversal is increasingly provided by Universal Plug and Play (UPnP), which is a 
protocol that allows a host to discover and configure a nearby NAT. 

e UPnP requires that both the host and the NAT be UPnP compatible. 

e With UPnP, an application running in a host can request a NAT mapping between its ( 
private IP address, private port number) and the ( public IP address, public port number) for 
some requested public port number. 

e If the NAT accepts the request and creates the mapping, then nodes from the outside can 
initiate TCP connections to (public IP address, public port number). 

e Furthermore, UPnP lets the application know the value of ( public IP address, public port 


number), so that the application can advertise it to the outside world. 
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Internet Control Message Protocol (ICMP) 


ICMP is used by hosts and routers to communicate network- layer information to each other. 
The most typical use of ICMP is for error reporting. 

ICMP is part of IP but architecturally it lies just above IP hence ICMP messages are carried 
inside IP datagrams. That is, ICMP messages are carried as IP payload, just as TCP or UDP 
segments are carried as IP payload. 

ICMP messages have a type and a code field, and contain the header and the first 8 bytes of 
the IP datagram that caused the ICMP message to be generated in the first place. 


Popular ICMP messages are listed below 





ICNP Type (ode Description 





echo reply (to ping) 





destination network unreachable 





destination host unreachable 





destination protocol unreachable 





destination port unreachable 





destination network unknown 





destination host unknown 





source quench (congestion control) 





echo request 





wolola|[wlwlwlwlwlwlo 


router odvertisement 





router discovery 





1 TIL expired 





elolcofolololraf[alwl[rw|—-lolo 


12 IP header bod 





The well-known ping program sends an ICMP type 8 code 0 message to the specified host. 
The destination host, seeing the echo request, sends back a type 0 code 0 ICMP echoreply. 
Congested router send an ICMP source quench message to a host toforce that host to reduce 


its transmission rate. 
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IPv6 
IPv6 Datagram Format 
32 bits 
L 
i 1 
Version _ Traffic class Flow label 
Payload length Next hdr Hop limit 
Source address 
(128 bits) 
Destination address 
(128 bits) 
Data 


The most important changes introduced in IPv6 are: 


Expanded addressing capabilities: IPv6 increases the size of the IP address from 32 to 128 
bits. 

A streamlined 40-byte header: 40 bytes of mandatory header is used in IPv6 whereas IPv4 
uses 20 bytes of mandatory header. 

Flow labeling and priority: Flow label refers to labeling of packets belonging to particular 
flows for which the sender requests special handling, such as a non default quality of service 
or real-time service. The IPv6 header also has an 8-bit traffic class field. This field, like the 


TOS field in IPv4, can be used to give priority to certain datagrams within a flow. 


The following fields are defined in IPv6: 


Version: This 4-bit field identifies the IP version number. 

Traffic class: This 8-bit field specify priority. 

Flow label: this 20-bit field is used to identify a flow of datagrams. 

Payload length: This 16-bit value is treated as an unsigned integer giving the number of 
bytes in the IPv6 datagram following the fixed-length, 40-byte datagram header. 

Next header: This field identifies the next following header. 

Hop limit: The contents of this field are decremented by one by each router that forwards the 
datagram. If the hop limit count reaches zero, the datagram is discarded. 


Source and destination addresses: 128 bit IPv6 address. 
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Data: This is the payload portion of the IPv6 datagram. 


Following fields appearing in the IPv4 datagram are no longer present in the IPv6 datagram: 


Fragmentation/Reassembly: IPv6 does not allow for fragmentation and reassembly at 
intermediate routers; these operations can be performed only by the source and destination. If 
an IPv6 datagram received by a router is too large to be forwarded over the outgoing link, the 
router simply drops the datagram and sends a “Packet Too Big” ICMP error message (see 
below) back to the sender. The sender can then resend the data, using a smaller IP datagram 
size. 

Header checksum: Because the transport-layer and link-layer protocols in the Internet layers 
perform check-summing, the designers of IP probably felt that this functionality was 
sufficiently redundant in the network layer that it could be removed. 

Options: An options field is no longer a part of the standard IP header. Instead of option 


field extension headers are used. 


Transitioning from IPv4 to IPv6 
1) Dual-stack: 


Here IPv6 nodes also have a complete IPv4 implementation. Such a node, has the ability to 
send and receive both IPv4 and IPv6 datagrams. When interoperating with an IPv4 node, an 
IPv6/IPv4 node can use IPv4 datagrams; when interoperating with an IPv6 node, it can speak 
IPv6. IPv6/IPv4 nodes must have both IPv6 and IPv4 addresses. They must furthermore be 
able to determine whether another node is IPv6-capable or IPv4-only. 

In the dual-stack approach, if either the sender or the receiver is only IPv4- capable, an IPv4 
datagram must be used. As a result, it is possible that two IPv6- capable nodes can end up, in 


essence, sending IPv4 datagrams to each other. 





ei— eS Ipva IPva IPvé 

= = ge = 
Flow: X Source: A Source: A Flow: 2? 
‘Source: A Dest: F Dest: F Source: A 
Dest: F Dest: F 

data data 

data data 
AtoB:IPv6 — Bto C: IPv4 DtoEIPv4 Eto F: IPv6 
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Example: Suppose Node A is IPv6-capable and wants to send an IP datagram to Node F, which 
is also IPv6-capable. Nodes A and B can exchange an IPv6 datagram. However, Node B must 
create an IPv4 datagram to send to C. Certainly, the data field of the IPv6 datagram can be 
copied into the data field of the IPv4 datagram and appropriate address mapping can be done. 
However, in performing the conversion from IPv6 to IPv4, there will be IPv6-specific fields in 
the IPv6 datagram that have no counterpart in IPv4. The information in these fields will be lost. 
Thus, even though E and F can exchange IPv6 datagrams, the arriving IPv4 datagrams at E from 


D do not contain all of the fields that were in the original IPv6 datagram sent from A. 


2) Tunneling 

Tunneling can solve the problem noted above. The basic idea behind tunneling is the following. 
Suppose two IPv6 nodes want to interoperate using IPv6 datagrams but are connected to each 
other by intervening IPv4 routers. We refer to the intervening set of IPv4 routers between two 
IPv6 routers as a tunnel. With tunneling, the IPv6 node on the sending side of the tunnel takes 
the entire IPv6 datagram and puts it in the data (payload) field of an IPv4 datagram. 


Logical view 
IPV6 IPV6 IPv6 IPV6 


GS—_a8 eS —_ea 


Physical view 








Iva Ipve. 
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IP Security 

IPsec is the security protocol used for IP security. The services provided by an IPsec session 
include: 

e Cryptographic agreement: Mechanisms that allow the two communicating hosts to agree 


on cryptographic algorithms and keys. 
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Encryption of IP datagram payloads: When the sending host receives a segment from the 
transport layer, IPsec encrypts the payload. The payload can only bedecrypted by IPsec in the 
receiving host. 

Data integrity: IPsec allows the receiving host to verify that the datagram’s header fields 
and encrypted payload were not modified while the datagram was en route from source to 
destination. 

Origin authentication: When a host receives an IPsec datagram from a trusted source, the 


host is assured that the source IP address in the datagram is the actual source of the datagram. 


Routing Algorithms 


The purpose of a routing algorithm is: given a set of routers, with links connecting the 
routers, a routing algorithm finds a “good” path from source router to destination router. 
Typically, a good path is one that has the least cost. 

Typically a host is attached directly to one router, the default router for the host. Whenever a 
host sends a packet, the packet is transferred to its default router. 

A graph is used to formulate routing problems. A graph G = (N,E) is a set N of nodes and a 
collection E of edges, where each edge is a pair of nodes from N. In the context of network- 
layer routing, the nodes in the graph represent routers and the edges connecting these nodes 
represent the physical links between these routers. 

An edge also has a value representing its cost. Typically, an edge’s cost may reflect the 
physical length of the corresponding link, the link speed, or the monetary cost associated 


with a link. 
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Classification: 


A global routing algorithm computes the least-cost path between a source and destination 
using complete, global knowledge about the network. That is, the algorithm takes the 
connectivity between all nodes and all link costs as inputs. This then requires that the 
algorithm somehow obtain this information before actually performing the calculation. In 
practice, algorithms with global state information are often referred to as link-state (LS) 
algorithms, since the algorithm must be aware of the cost of each link in the network. 

In a decentralized routing algorithm, the calculation of the least-cost path is carried out in 
an iterative, distributed manner. No node has complete information about the costs of all 
network links. Instead, each node begins with only the knowledge of the costs of its own 
directly attached links. Then, through an iterative process of calculation and exchange of 
information with its neighboring nodes, a node gradually calculates the least-cost path to a 
destination or set of destinations. 

In static routing algorithms, routes change very slowly over time, often as a result of 
human intervention. 

Dynamic routing algorithms change the routing paths as the network traffic loads or 
topology change. 

In a load-sensitive algorithm, link costs vary dynamically to reflect the current level of 


congestion in the underlying link. 


The Link-State (LS) Routing Algorithm 


In a link-state algorithm, the network topology and all link costs are known, that is, available as 


input to the LS algorithm. 


In practice this is accomplished by having each node broadcast link-state packets to all other 


nodes in the network, with each link-state packet containing the identities and costs of its 


attached links. 


Link state uses Dijkstra’s algorithm for finding shortest path. 


Notation: 


+ Dv): cost of the least-cost path from the source node to destination v as of this iteration of the 


algorithm. 


* p(v): previous node (neighbor of v) along the current least-cost path from the source to v. 
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+ N’ : subset of nodes; v is in N’ if the least-cost path from the source to v is definitively known. 
Algorithm: 
1 Initialization: 
2 N' = {u} 
3 for all nodes v 
4 if v is a neighbor of u 
5 then D(v) = c(u,v) 
6 else D(v) = = 
7 
8 


Loop 
9 find w not in N’ such that D(w) is a minimum 
10 add w to N’ 
11 update D(v) for each neighbor v of w and not in N’: 


12 D(v) = min( D(v), D(w) + c(w,v) ) 
13 /* new cost to v is either old cost to v or known 
14 least path cost to w plus cost from w to v */ 


15 until N’'= N 
Note: for problems Refer Class Notes 


Example: 
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The Distance-Vector (DV) Routing Algorithm 


Whereas the LS algorithm is an algorithm using global information, the distance vector (DV) 
algorithm is iterative, asynchronous, and distributed. 
It is distributed in that each node receives some information from one or more of its directly 
attached neighbors, performs a calculation, and then distributes the results of its calculation 
back to its neighbors. 
It is iterative in that this process continues on until no more information is exchanged 
between neighbors. 
The algorithm is asynchronous in that it does not require all of the nodes to operate in 
lockstep with each other. 
Let dx(y) be the cost of the least-cost path from node x to node y. 
dy) = min, {e(x,v) + d,(y)} 

Each node x begins with Dx(y), an estimate of the cost of the least-cost path from itself to 
node y, for all nodes in N. 
Let Dx = [Dx(y): y in N] be node x’s distance vector, which is the vector of cost estimates 
from x to all other nodes, y, in N. 
With the DV algorithm, each node x maintains the following routing information: 
= For each neighbor v, the cost c(x,v) from x to directly attached neighbor, v 
= Node x’s distance vector, that is, Dx = [Dx(y): y in N], containing x’s estimate of its cost 

to all destinations, y, in N 
= The distance vectors of each of its neighbors, that is, Dv = [Dv(y): y in N] for each 


neighbor v of x 
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Distance-Vector (DV) Algorithm 
At each node, x: 


1 Initialization: 

2 for all destinations y in N: 

3 Dy) = ¢(x-y) /* if y is not a neighbor then c(x,y) = ~ */ 
4 for each neighbor w 

5 D,(y) = ? for all destinations y in N 

6 for each neighbor w 

7 send distance vector D, = [D,(y): y in N] tow 

8 


9 loop 
10 wait (until I see a link cost change to some neighbor w or 
ll until I receive a distance vector from some neighbor w) 
12 
13 for each y in N: 
14 D.(y) = min,{c(x,v) + D,(y)} 
15 
16 if D,(y) changed for any destination y 
17 send distance vector D, = [D,(y): y in N] to all neighbors 
18 
19 forever 
Example: 


Note: Refer Class notes for problem 
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Distance-Vector Algorithm: Link-Cost Changes and Link Failure: 
When a node running the DV algorithm detects a change in the link cost from itself to a 
neighbor, it updates its distance vector and, if there’s a change in the cost of the least-cost path, 


informs its neighbors of its new distance vector. 


Distance-Vector Algorithm: Adding Poisoned Reverse: 
In the above example if z routes through y to get to destination x, then z will advertise to y that 
its distance to x is infinity, that is, z will advertise to y that Dz(x) = © (even though z knows 


Da(x) = 5 in truth). z will continue telling this little white lie to y as long as it routes to x via y. 
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Since y believes that z has no path to x, y will never attempt to route to x via z, as long as z 


continues to route to x via y. 


A Comparison of LS and DV Routing Algorithms: 


Message complexity: LS requires O(INI IEl) messages to be sent. Also, whenever a link cost 
changes, the new link cost must be sent to all nodes. The DV algorithm requires message 
exchanges between directly connected neighbors at each iteration. When link costs change, 
the DV algorithm will propagate the results of the changed link cost only if the new link cost 
results in a changed least-cost path for one of the nodes attached to that link. 

Speed of convergence: LS is an O(INI2) algorithm requiring O(INI IEI)) messages. The DV 
algorithm can converge slowly and can have routing loops while the algorithm is converging. 
DV also suffers from the count-to-infinity problem. 

Robustness: Under LS, a router could broadcast an incorrect cost for one of its attached links 
(but no others). A node could also corrupt or drop any packets it received as part of an LS 
broadcast. But an LS node is computing only its own forwarding tables; other nodes are 
performing similar calculations for themselves. This means route calculations are somewhat 
separated under LS, providing a degree of robustness. Under DV, a node can advertise 


incorrect least-cost paths to any or all destinations. 


Hierarchical Routing 


In homogenous Routing all routers executes the same routing algorithm to compute routing paths 


through the entire network. Homogenous routing is very simple for the following reason: 


Scale: As the number of routers becomes large, the overhead involved in computing, storing, 
and communicating routing information becomes prohibitive. Today’s public Internet 
consists of hundreds of millions of hosts. Storing routing information at each of these hosts 
would clearly require enormous amounts of memory. The overhead required to broadcast LS 
updates among all of the routers in the public Internet would leave no bandwidth left for 
sending data packets! A distance-vector algorithm that iterated among such a large number of 
routers would surely never converge. Clearly, something must be done to reduce the 


complexity of route computation in networks as large as the public Internet. 
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e Administrative autonomy. Although researchers tend to ignore issues such as a company’s 
desire to run its routers as it please or to hide aspects of its network’s internal organization 
from the outside, these are important considerations. Ideally, an organization should be able 
to run and administer its network as it wishes, while still being able to connect its network to 
other outside networks. 

Both of these problems can be solved by organizing routers into autonomous systems (ASs), 

with each AS consisting of a group of routers that are typically under the same administrative 

control. 

Routers within the same AS all run the same routing algorithm and have information about each 

other. The routing algorithm running within an autonomous system is called an intra autonomous 


system routing protocol. 
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e An intra-AS routing protocol is used to determine how routing is performed within an 
autonomous system (AS). 

e Intra-AS routing protocols are also known as interior gateway protocols. 

e RIP isa distance-vector protocol that operates in a manner very close to the idealized DV 
protocol. 


e RIP uses hop count as a cost metric; that is, each link has a cost of 1. 


Destination Hops 


u v 
GS CH u 1 
~ v 2 
w 2 
x x 3 
aE y 3 

z 
Zz y 


e The maximum cost of a path is limited to 15, thus limiting the use of RIP to autonomous 
systems that are fewer than 15 hops in diameter. 

e In RIP, routing updates are exchanged between neighbors approximately every 30 seconds 
using a RIP response message. The response message sent by a router or host contains a list 
of up to 25 destination subnets within the AS, as well as the sender’s distance to each of 
those subnets. Response messages are also known as RIP advertisements. 

e Each router maintains a RIP table known as a routing table. A router’s routing table includes 
both the router’s distance vector and the router’s forwarding table. 

Example: 


Consider the following network: 








Below figure shows the routing table for Node D 
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Destination Subnet Next Router Number of Hops to Destination 
w A 2 
y B 2 
z B 7 
x - 1 








Node A sends the advertisement message as follows: 














Destination Subnet Next Router Number of Hops to Destination 
z C . 
w - 1 
x = 1 








When Router D receives the advertisement message Its routing table will be updated as shown 


below: 














Destination Subnet Next Router Number of Hops to Destination 
w A 2 
y B 2 
z A 5 








Intra-AS Routing in the Internet: OSPF 


¢ OSPF is a link-state protocol that uses flooding of link-state information and a Dijkstra least- 
cost path algorithm. 

e¢ With OSPF, a router constructs a complete topological map (that is, a graph) of the entire 
autonomous system. 

e The router then locally runs Dijkstra’s shortest-path algorithm to determine a shortest-path 
tree to all subnets, with itself as the root node. 

e Individual link costs are configured by the network administrator. The administrator might 


choose to set all link costs to 1, thus achieving minimum-hop routing, or might choose to set 
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the link weights to be inversely proportional to link capacity in order to discourage traffic 
from using low-bandwidth links. 
e With OSPF, a router broadcasts routing information to all other routers in the autonomous 
system, not just to its neighboring routers. 
e A router broadcasts linkstate information whenever there is a change in a link’s state. It also 
broadcasts a link’s state periodically (at least once every 30 minutes), even if the link’s state 
has not changed. 
e¢ OSPF advertisements are contained in OSPF messages that are carried directly by IP, with an 
upper-layer protocol of 89 for OSPF. 
e Some of the advances embodied in OSPF include the following: 
= Security: Exchanges between OSPF routers can be authenticated. With authentication, 
only trusted routers can participate in the OSPF protocol within an AS, thus preventing 
malicious intruders from injecting incorrect information into router tables. By default, 
OSPF packets between routers are not authenticated and could be forged. Two types of 
authentication can be configured—simple and MDS. With simple authentication, the 
same password is configured on each router. When a router sends an OSPF packet, it 
includes the password in plaintext. Clearly, simple authentication is not very secure. 
MDS authentication is based on shared secret keys that are configured in all the routers. 
For each OSPF packet that it sends, the router computes the MDS hash of the content of 
the OSPF packet appended with the secret key. Then the router includes the resulting 
hash value in the OSPF packet. The receiving router, using the preconfigured secret key, 
will compute an MDS hash of the packet and compare it with the hash value that the 
packet carries, thus verifying the packet’s authenticity. Sequence numbers are also used 
with MDS authentication to protect against replay attacks. 

= Multiple same-cost paths: When multiple paths to a destination have the same cost, 
OSPF allows multiple paths to be used. 

= Integrated support for unicast and multicast routing: Multicast OSPF (MOSPF) 
provides simple extensions to OSPF to provide for multicast routing. MOSPF uses the 
existing OSPF link database and adds a new type of link-state advertisement to the 


existing OSPF link-state broadcast mechanism. 
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= Support for hierarchy within a single routing domain: Perhaps the most significant 
advance in OSPF is the ability to structure an autonomous system hierarchically. 

e An OSPF autonomous system can be configured hierarchically into areas. Each area runs its 
own OSPF link-state routing algorithm, with each router in an area broadcasting its link state 
to all other routers in that area. 

e Within each area, one or more area border routers are responsible for routing packets outside 
the area. Lastly, exactly one OSPF area in the AS is configured to be the backbone area. 

e The primary role of the backbone area is to route traffic between the other areas in the AS. 
The backbone always contains all area border routers in the AS and may contain non border 
routers as well. 

e Inter-area routing within the AS requires that the packet be first routed to an area border 
router (intra-area routing), then routed through the backbone to the area border router that is 


in the destination area, and then routed to the final destination. 
Inter-AS Routing: BGP 
BGP provides each AS a means to 


1. Obtain subnet reachability information from neighboring ASs. 
2. Propagate the reachability information to all routers internal to the AS. 


3. Determine “good” routes to subnets based on the reachability information and on AS policy. 
BGP Basics: 


e¢ InBGP, pairs of routers exchange routing information over semi permanent TCP connections 
using port 179. 

e For each TCP connection, the two routers at the end of the connection are called BGP peers, 
and the TCP connection along with all the BGP messages sent over the connection is called a 
BGP session. 

e A BGP session that spans two ASs is called an external BGP (eBGP) session, and a BGP 


session between routers in the same AS is called an internal BGP (iBGP) session. 
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¢ BGP allows each AS to learn which destinations are reachable via its neighboring ASs. In 
BGP, destinations are not hosts but instead are CIDRized prefixes, with each prefix 
representing a subnet or a collection of subnets. 

e For example, suppose there are four subnets attached to AS2: 138.16.64/24, 138.16.65/24, 
138.16.66/24, and 138.16.67/24. Then AS2 could aggregate the prefixes for these four 
subnets and use BGP to advertise the single prefix to 138.16.64/22 to AS1. 

e Using the eBGP session between the gateway routers 3a and 1c, AS3 sends ASI the list of 
prefixes that are reachable from AS3; and ASI sends AS3 the list of prefixes that are 
reachable from AS1. Similarly, AS] and AS2 exchange prefix reachability information 
through their gateway routers |b and 2a. 

e When a gateway router (in any AS) receives eBGP-learned prefixes, the gateway router uses 
its iBGP sessions to distribute the prefixes to the other routers in the AS. Thus, all the routers 
in AS] learn about AS3 prefixes, including the gateway router 1b. 

e The gateway router 1b (in AS1) can therefore re-advertise AS3’s prefixes to AS2. When a 
router (gateway or not) learns about a new prefix, it creates an entry for the prefix in its 


forwarding table. 
Path Attributes and BGP Routes: 


e AS-PATH: This attribute contains the ASs through which the advertisement for the prefix 
has passed. When a prefix is passed into an AS, the AS adds its ASN to the ASPATH 
attribute. 

¢ The NEXT-HOP is the router interface that begins the AS-PATH. The NEXT-HOP attribute 


is used by routers to properly configure their forwarding tables. 
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¢ BGP also includes attributes that allow routers to assign preference metrics to the routes, and 


an attribute that indicates how the prefix was inserted into BGP at the origin AS. 
BGP Route Selection: 


¢ Routes are assigned a local preference value as one of their attributes. The local preference of 
a route could have been set by the router or could have been learned by another router in the 
same AS. This is a policy decision that is left up to the AS’s network administrator. The 
routes with the highest local preference values are selected. 

e From the remaining routes (all with the same local preference value), the route with the 
shortest AS-PATH is selected. 

e From the remaining routes the route with the closest NEXT-HOP router is selected. 


e If more than one route still remains, the router uses BGP identifiers to select the route. 


Broadcast and Multicast Routing: 

In broadcast routing, the network layer provides a service of delivering a packet sent from a 
source node to all other nodes in the network; multicast routing enables a single source node to 
send a copy of a packet to a subset of the other network nodes. 


Broadcast Routing Algorithms: 
Uncontrolled Flooding: 


e The most obvious technique for achieving broadcast is a flooding approach in which the 
source node sends a copy of the packet to all of its neighbors. 

e¢ When a node receives a broadcast packet, it duplicates the packet and forwards it to all of its 
neighbors. 

¢ Clearly, if the graph is connected, this scheme will eventually deliver a copy of the broadcast 
packet to all nodes in the graph. 

e Although this scheme is simple and elegant, it has a fatal flaw: If the graph has cycles, then 


one or more copies of each broadcast packet will cycle indefinitely. 
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Duplicate creation/transmission 








For example, in the above figure, R2 will flood to R3, R3 will flood to R4, R4 will flood to 
R2, and R2 will flood to R3, and so on. This simple scenario results in the endless cycling of 
two broadcast packets, one clockwise, and one counterclockwise. 

When a node is connected to more than two other nodes, it will create and forward multiple 
copies of the broadcast packet, each of which will create multiple copies of itself and so on. 
This broadcast storm, resulting from the endless multiplication of broadcast packets, would 
eventually result in so many broadcast packets being created that the network would be 


rendered useless. 


Controlled Flooding 


The key to avoiding a broadcast storm is for a node to judiciously choose when to flood a packet 


and when not to flood a packet. 


Sequence-number-controlled flooding: 


In sequence-number-controlled flooding, a source node puts its address (or other unique 
identifier) as well as a broadcast sequence number into a broadcast packet, then sends the 
packet to all of its neighbors. 

Each node maintains a list of the source address and sequence number of each broadcast 
packet it has already received, duplicated, and forwarded. 

When a node receives a broadcast packet, it first checks whether the packet is in this list. If 
so, the packet is dropped; if not, the packet is duplicated and forwarded to all the node’s 


neighbors (except the node from which the packet has just been received). 
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Reverse path forwarding or Reverse path broadcast: 


e A second approach to controlled flooding is known as reverse path forwarding (RPF) also 
sometimes referred to as reverse path broadcast (RPB). 

e When a router receives a broadcast packet with a given source address, it transmits the packet 
on all of its outgoing links (except the one on which it was received) only if the packet 
arrived on the link that is on its own shortest unicast path back to the source. 

e Otherwise, the router simply discards the incoming packet without forwarding it on any of its 
outgoing links. 

e Such a packet can be dropped because the router knows it either will receive or has already 


received a copy of this packet on the link that is on its own shortest path back to the sender. 


Example: 





~—> pkt will be forwarded 
—=1 pit not forwarded beyond receiving router 


e In below figure suppose that the links drawn with thick lines represent the least-cost paths 
from the receivers to the source (A). 

e Node A initially broadcasts a source-A packet to nodes C and B. 

© Node B will forward the source-A packet it has received from A to both C and D. 


e B will ignore any source-A packets it receives from any other nodes. 
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Let us now consider node C, which will receive a source-A packet directly from A as well as 
from B. Since B is not on C’s own shortest path back to A, C will ignore any source-A 
packets it receives from B. 

On the other hand, when C receives a source-A packet directly from A, it will forward the 


packet to nodes B, E, and F. 


Spanning-Tree Broadcast 


While sequence-number-controlled flooding and RPF avoid broadcast storms, they do not 
completely avoid the transmission of redundant broadcast packets. 

Ideally, every node should receive only one copy of the broadcast packet. Examining the tree 
consisting of the nodes connected by thick lines in the below figure, if broadcast packets 
were forwarded only along links within this tree, each and every network node would receive 


exactly one copy of the broadcast packet. 





When a source node wants to send a broadcast packet, it sends the packet out on all of the 
incident links that belong to the spanning tree. 

A node receiving a broadcast packet then forwards the packet to all its neighbors in the 
spanning tree. 

Here a node need not be aware of the entire tree; it simply needs to know which of its 
neighbors in G are spanning-tree neighbors. 

The main complexity associated with the spanning-tree approach is the creation and 
maintenance of the spanning tree. 

In the center-based approach to building a spanning tree, a center node is defined. 


Nodes then unicast tree-join messages addressed to the center node. 
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e A tree-join message is forwarded using unicast routing toward the center until it either arrives 
at a node that already belongs to the spanning tree or arrives at the center. 
e In either case, the path that the tree-join message has followed defines the branch of the 


spanning tree between the edge node that initiated the tree-join message and the center. 


Example: 





a. Stepwise construction of spanning tree b. Constructed spanning tree 


e Suppose that node E is selected as the center of the tree. Suppose that node F first joins the 
tree and forwards a tree-join message to E. The single link EF becomes the initial spanning 
tree. Node B then joins the spanning tree by sending its tree-join message to E. 

e Suppose that the unicast path route to E from B is via D. In this case, the tree-join message 
results in the path BDE being grafted onto the spanning tree. 

e Node A next joins the spanning group by forwarding its tree-join message towards E. If A’s 
unicast path to E is through B, then since B has already joined the spanning tree, the arrival 
of A’s tree-join message at B will result in the AB link being immediately grafted onto the 
spanning tree. 

e Node C joins the spanning tree next by forwarding its tree-join message directly to E. 

e Finally, because the unicast routing from G to E must be via node D, when G sends its tree- 


join message to E, the GD link is grafted onto the spanning tree at node D. 


Multicast 


A number of emerging network applications require the delivery of packets from one or more 


senders to a group of receivers. These applications include bulk data transfer, streaming 
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continuous media, shared data applications, data feeds, Web cache updating, and interactive 
gaming. 

In the case of unicast communication, the IP address of the receiver (destination) is carried in 
each IP unicast datagram and identifies the single recipient; in the case of broadcast, all nodes 
need to receive the broadcast packet, so no destination addresses are needed. But in the case of 
multicast, we now have multiple receivers. Does it make sense for each multicast packet to carry 
the IP addresses of all of the multiple recipients? While this approach might be workable with a 
small number of recipients, it would not scale well to the case of hundreds or thousands of 
receivers; the amount of addressing information in the datagram would swamp the amount of 
data actually carried in the packet’s payload field. Explicit identification of the receivers by the 
sender also requires that the sender know the identities and addresses of all of the receivers. 


Internet Group Management Protocol 


e¢ The IGMP protocol version 3 operates between a host and its directly attached router. 

¢ Below Figure shows three first-hop multicast routers, each connected to its attached hosts via 
one outgoing local interface. This local interface is attached to a LAN in this example, and 
while each LAN has multiple attached hosts, at most a few of these hosts will typically 


belong to a given multicast group at any given time. 


IGMP 


\GMP IGMP 
Wide-area 


multicast -“@@es— 
routing 


IGMP 
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IGMP provides the means for a host to inform its attached router that an application running 
on the host wants to join a specific multicast group. 

Given that the scope of IGMP interaction is limited to a host and its attached router, another 
protocol is clearly required to coordinate the multicast routers throughout the Internet, so that 
multicast datagrams are routed to their final destinations. This latter functionality is 
accomplished by network-layer multicast routing algorithms. 

IGMP has only three message types. Like ICMP, IGMP messages are carried (encapsulated) 
within an IP datagram, with an IP protocol number of 2. 

The membership_query message is sent by a router to all hosts on an attached interface to 
determine the set of all multicast groups that have been joined by the hosts on that interface. 
Hosts respond to a membership_query message with an IGMP membership_report 
message. 

membership_report messages can also be generated by a host when an application first 
joins a multicast group without waiting for a membership_query message from the router. 
The final type of IGMP message is the leave_group message. Interestingly, this message is 
optional. 

In a softstate protocol, the state is removed via a timeout if it is not explicitly refreshed (in 


this case, by a membership_report message from an attached host). 


Multicast Routing Algorithms 


In practice, two approaches have been adopted for determining the multicast routing tree. 


Multicast routing using a group-shared tree: i 





based on building a tree that includes all 
edge routers with attached hosts belonging to the multicast group. In practice, a center-based 
approach is used to construct the multicast routing tree, with edge routers with attached hosts 
belonging to the multicast group sending (via unicast) join messages addressed to the center 
node. As in the broadcast case, a join message is forwarded using unicast routing toward the 
center until it either arrives at a router that already belongs to the multicast tree or arrives at 
the center. All routers along the path that the join message follows will then forward received 
multicast packets to the edge router that initiated the multicast join. 

Multicast routing using a source-based tree: While group-shared tree multicast routing 


constructs a single, shared routing tree to route packets from all senders, the second approach 
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constructs a multicast routing tree for each source in the multicast group. In practice, an RPF 
algorithm (with source node x) is used to construct a multicast forwarding tree for multicast 
datagram’s originating at source x. The RPF broadcast algorithm we studied earlier requires a 
bit of tweaking for use in multicast. To see why, consider router D in below Figure Under 
broadcast RPF, it would forward packets to router G, even though router G has no attached 
hosts that are joined to the multicast group. While this is not so bad for this case where D has 
only a single downstream router, G, imagine what would happen if there were thousands of 
routers downstream from D! Each of these thousands of routers would receive unwanted 


multicast packets. 


S$: source 





Key: 
—> kt will be forwarded 
—=bil pkt not forwarded beyond receiving router 


The solution to the problem of receiving unwanted multicast packets under RPF is known as 
pruning. A multicast router that receives multicast packets and has no attached hosts joined to 
that group will send a prune message to its upstream router. If a router receives prune messages 
from each of its downstream routers, then it can forward a prune message upstream. 


Multicast Routing in the Internet 
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e The first multicast routing protocol used in the Internet was the Distance-Vector Multicast 
Routing Protocol (DVMRP). DVMRP implements source-based trees with reverse path 
forwarding and pruning. DVMRP uses an RPF algorithm with pruning. 

e Perhaps the most widely used Internet multicast routing protocol is the Protocol-Independent 
Multicast (PIM) routing protocol, which explicitly recognizes two multicast distribution 
scenarios. In dense mode, multicast group members are densely located; that is, many or 
most of the routers in the area need to be involved in routing multicast datagrams. PIM dense 
mode is a flood-and-prune reverse path forwarding technique similar in spirit to DVMRP. In 
sparse mode, the number of routers with attached group members is small with respect to the 
total number of routers; group members are widely dispersed. PIM sparse mode uses 
rendezvous points to set up the multicast distribution tree. 

e In source-specific multicast, only a single sender is allowed to send traffic into the multicast 


tree, considerably simplifying tree construction and maintenance. 
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Module —5 
NETWORK LAYER 
5.1 IPv4 ADDRESSES 


An IPv4 address is a 32-bit address that uniquely and universally defines the connection of a 


device (for example, a computer or a router) to the Internet. 


Address Space 

e An address space is the total number of addresses used by the protocol. If a protocol uses N 
bits to define an address, the address space is 2" because each bit can have two different 
values (0 or 1) and N bits can have 2" values. 

© IPv4 uses 32-bit addresses, which means that the address space is 2** or 4,294,967,296 (more 


than 4 billion). 


Notations 
There are two prevalent notations to show an IPy4 address: binary notation and dotted decimal 
notation. 
1) Binary Notation: 
In binary notation, the IPv4 address is displayed as 32 bits. Each octet is often referred to as a 
byte. The following is an example of an IPv4 address in binary notation: 

01110101 10010101 00011101 00000010 
2) Dotted-Decimal Notation: 
To make the IPv4 address more compact and easier to read, Internet addresses are usually written 
in decimal form with a decimal point (dot) separating the bytes. The following is the dotted 
decimal notation of the above address: 

117.149.29.2 

Example: 
Change the following IPv4 addresses from binary notation to dotted-decimal notation. 
a. 10000001 00001011 00001011 11101111 
b. 11000001 10000011 00011011 11111111 





Abhijith H V, Assistant Professor, Dept. of ISE, SVIT, Bangalore Page 1 


Data Communication Module 5: Network Layer 





Solution: 
a. 129.11.11.239 
b. 193.131.27.255 


Classful Addressing 

IPv4 addressing, at its inception, used the concept of classes. This architecture is called classful 
addressing. 

In classful addressing, the address space is divided into five classes: A, B, C, D, and E. Each class 


occupies some part of the address space. 





First Second Third Fourth First Second — Third Fourth 
byte byte byte byte byte byte byte byte 
Class A 








Class A 


| [e727 ] 
classe [70 [| Class 8 [128-191] 
fo J] [192223] 























































































































Class C I) Class C = 
Class D }1110 Class D |224-239 
Class E |1111 Class E }240-255 

a. Binary notation b, Dotted-decimal notation 

Classes and Blocks 


One problem with classful addressing is that each class is divided into a fixed number of blocks 


with each block having a fixed size. 




















Class Number of Blocks Block Size Application 
A 128 16,777,216 Unicast 
B 16,384 65,536 Unicast 
Cc 2,097,152 256 Unicast 
D iL 268,435,456 Multicast 
iE il 268,435,456 Reserved 




















e Class A addresses were designed for large organizations with a large number of attached 


hosts or routers. 
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e Class B addresses were designed for midsize organizations with tens of thousands of attached 
hosts or routers. 

e Class C addresses were designed for small organizations with a small number of attached 
hosts or routers. 

e Class D addresses were designed for multicasting. 

© Class E addresses were reserved for future use. 


¢ Inclassfnl addressing, a large part of the available addresses were wasted. 


Netid and Hostid 

e In classful addressing, an IP address in class A, B, or C is divided into netid and hostid. 
These parts are of varying lengths, depending on the class of the address. 

e  Inclass A, one byte defines the netid and three bytes define the hostid. 

¢ Inclass B, two bytes define the netid and two bytes define the hostid. 

e  Inclass C, three bytes define the netid and one byte defines the hostid. 


Mask 
Although the length of the netid and hostid (in bits) is predetermined in classful addressing, we 
can also use a mask (also called the default mask), a 32-bit number made of contiguous 1s 


followed by contiguous as. 




















Class Binary Dotted-Decimal CIDR 
A TLILL1L1 00000000 00000000 00000000 1 
B 1111111 11111111 00000000 00000000 ‘16 
Cc VEVT1001 11011111 11111111 00000000 124 

















The mask can help us to find the netid and the hostid. For example, the mask for a class A 
address has eight 1s, which means the first 8 bits of any address in class A define the netid; the 


next 24 bits define the hostid. 


Subnetting: If an organization was granted a large block in class A or B, it could divide the 
addresses into several contiguous groups and assign each group to smaller networks (called 


subnets) or, in rare cases, share part of the addresses with neighbors. 
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Supernetting: In supernetting, an organization can combine several class C blocks to create a 
larger range of addresses. In other words, several networks are combined to create a 


supernetwork or a supernet. 


Address Depletion 


The flaws in classful addressing scheme combined with the fast growth of the Internet led to the 
near depletion of the available addresses. We have run out of class A and B addresses, and a 
class C block is too small for most midsize organizations. One solution that has alleviated the 


problem is the idea of classless addressing. 


Classless Addressing 
To overcome address depletion and give more organizations access to the Internet, classless 
addressing was designed and implemented. In this scheme, there are no classes, but the addresses 


are still granted in blocks. 


Address Blocks 

In classless addressing, when an entity, small or large, needs to be connected to the Internet, it is 
granted a block (range) of addresses. The size of the block (the number of addresses) varies 
based on the nature and size of the entity. To simplify the handling of addresses, the Internet 
authorities impose three restrictions on classless address blocks: 

1. The addresses in a block must be contiguous, one after another. 

2. The number of addresses in a block must be a power of 2 (1, 2, 4, 8, ... ). 


3. The first address must be evenly divisible by the number of addresses. 


Mask 

e In classless addressing the mask for a block can take any value from 0 to 32. It is very 
convenient to give just the value of n preceded by a slash (CIDR notation). 

e In 1Pv4 addressing, a block of addresses can be defined as x.y.z.t/n in which x.y.z.t defines 
one of the addresses and the /n defines the mask. 


e The first address in the block can be found by setting the rightmost 32 - n bits to Os. 
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¢ The last address in the block can be found by setting the rightmost 32 - 17 bits to 1s. 


© The number of addresses in the block can be found by using the formula 2*7- n 


Network Addresses 

When an organization is given a block of addresses, the organization is free to allocate the 
addresses to the devices that need to be connected to the Internet. The first address in the class, 
however, is normally (not always) treated as a special address. The first address is called the 
network address and defines the organization network. It defines the organization itself to the 


rest of the world. 


Block Block 


First 205.16.37.32 
205.16.37.33 


16 Addresses 





Last 205.16.37.47 





a. Decimal b. Binary 

Hierarchy 

Two-Level Hierarchy: No Subnetting 

e An IP address can define only two levels of hierarchy when not subnetted. 

e The n leftmost bits of the address x.y.z.t/n define the network (organization network); the 32 
—n rightmost bits define the particular host (computer or router) to the network. 

e The part of the address that defines the network is called the prefix; the part that defines the 


host is called the suffix. 


28 





Network prefix | 





Host address 


, rr ora 
Y 


a 
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Three-Levels of Hierarchy: Subnetting 

e An organization that is granted a large block of addresses may want to create clusters of 
networks (called subnets) and divide the addresses between the different subnets. The rest of 
the world still sees the organization as one entity; however, internally there are several 
subnets. 

e All messages are sent to the router address that connects the organization to the rest of the 
Internet; the router routes the message to the appropriate subnets. 

e The organization, however, needs to create small sub blocks of addresses, each assigned to 


specific subnets. The organization has its own mask; each subnet must also have its own. 


Example: 

Suppose an organization is given the block 17.12.40.0/26, which contains 64 addresses. The 

organization has three offices and needs to divide the addresses into three sub blocks of 32, 16, 

and 16 addresses. We can find the new masks by using the following arguments: 

1. Suppose the mask for the first subnet is nl, then 2°?-n1 must be 32, which means that nl =27. 

2. Suppose the mask for the second subnet is n2, then 2°°-n2 must be 16, which means that 
n2=28. 


3. Suppose the mask for the third subnet is n3, then 2°°n3 must be 16, which means that n3=28. 


Subnet 3 







17.12.14.50/28 17.12.14.49/28 


q 17,12.14,33/28 


Network: 17.12:140726 


To the rest of 
the Internet 
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a. In subnet 1, the address 17.12.14.29/27 can give us the subnet address if we use the mask /27 
because 

Host: 00010001 00001100 00001110 00011101 

Mask: /27 

Subnet: 00010001 00001100 00001110 00000000 .... (17.12.14.0) 
b. In subnet 2, the address 17.12.14.45/28 can give us the subnet address if we use the mask /28 
because 

Host: 00010001 00001100 00001110 00101101 

Mask: /28 

Subnet: 00010001 00001100 00001110 00100000 .... (17.12.14.32) 
c. In subnet 3, the address 17.12.14.50/28 can give us the subnet address if we use the mask /28 
because 

Host: 00010001 00001100 00001110 00110010 

Mask: /28 


Subnet: 00010001 00001100 00001110 00110000 .... (17.12.14.48) 
Subnet 1 Subnets 2 and 3 


Network prefix 1 
jeer prem s 


Subnet prefix 

















1 
Network prefix 1 
1 — ERE 


Subnet prefix 


: renee 








Host address 








1 
1 

1 

1 

1 

1 

1 

Host address 1 
1 


bth taal 


qaeges 


More Levels of Hierarchy 
The structure of classless addressing does not restrict the number of hierarchical levels. An 
organization can divide the granted block of addresses into sub blocks. Each sub block can in 


turn be divided into smaller sub blocks. 


Network Address Translation (NAT) 

e The number of home users and small businesses that want to use the Internet is ever 
increasing. 

e In the beginning, a user was connected to the Internet with a dial-up line. An ISP with a 
block of addresses could dynamically assign an address to this user. An address was given to 


auser when it was needed. 
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e But the situation is different today. Home users and small businesses can be connected by an 
ADSL line or cable modem. In addition, many are not happy with one address; many have 
created small networks with several hosts and need an IP address for each host. With the 
shortage of addresses, this is a serious problem. 

e A quick solution to this problem is called network address translation (NAT). 

e NAT enables a user to have a large set of addresses internally and one address, or a small set 
of addresses, externally. The traffic inside can use the large set; the traffic outside, the small 
set. 

e To separate the addresses used inside the home or business and the ones used for the Internet, 


the Internet authorities have reserved three sets of addresses as private addresses. 














Range Total 
10.0.0.0 to 10.255.255.255 24 
172.16.0.0 to 172.31-255.255 270 
192.168.0.0 to 192.168.255.255 gle 














e Any organization can use an address out of this set without permission from the Internet 
authorities. Everyone knows that these reserved addresses are for private networks. They are 
unique inside the organization, but they are not unique globally. No router will forward a 
packet that has one of these addresses as the destination address. 

e The site must have only one single connection to the global Internet through a router that 


runs the NAT software. 


Site using private addresses 


NAT router 


200.24.5.8 





In the above figure, the router that connects the network to the global address uses one private 
address and one global address. The private network is transparent to the rest of the Internet; the 


rest of the Internet sees only the NAT router with the address 200.24.5.8. 
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Address Translation 

e All the outgoing packets go through the NAT router, which replaces the source address in 
the packet with the global NAT address. 

e All incoming packets also pass through the NAT router, which replaces the destination 


address in the packet (the NAT router global address) with the appropriate private address. 


Source: 200.24.5.8 


Destination: 200.24.5.8 











e A translation table has only two columns: the private’ address and the external address 
(destination address of the packet). When the router translates the source address of the 
outgoing packet, it also makes note of the destination address-where the packet is going. 
When the response comes back from the destination, the router uses the source address of the 


packet (as the external address) to find the private address of the packet. 


Destination: 25.8.2.10 
Source: 200.24.5.8 











Translation table 
Private External 


= 172.18.3.1 25.8.2.10 ~< 


























Destination: 200.24.5.8 


Source: 25.8.2.10 : 
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e Using a Pool of IP Addresses Since the NAT router has only one global address, only one 
private network host can access the same external host. To remove this restriction, the NAT 
router uses a pool of global addresses. 

e Using Both IP Addresses and Port Numbers To allow a many-to-many relationship between 
private-network hosts and external server programs, we need more information in the 


translation table. 



































Private Private | External | External Transport 
Address Port Address Port Protocol 
172.18.3.1 1400 25.8.3.2 80 TCP 
172.18.3.2 1401 25.8.3.2 80 TCP 
NAT and ISP 


An ISP that serves dial-up customers can use NAT technology to conserve addresses. For 
example, suppose an ISP is granted 1000 addresses, but has 100,000 customers. Each of the 
customers is assigned a private network address. The ISP translates each of the 100,000 source 
addresses in outgoing packets to one of the 1000 global addresses; it translates the global 
destination address in incoming packets to the corresponding private address. 


172.18.3.1 





1000 
addresses 
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5.2 IPv6 Addresses 
An IPvé6 address consists of 16 bytes (octets); it is 128 bits long. 


Hexadecimal Colon Notation 

To make addresses more readable, IPv6 specifies hexadecimal colon notation. In this notation, 
128 bits is divided into eight sections, each 2 bytes in length. Two bytes in hexadecimal notation 
requires four hexadecimal digits. Therefore, the address consists of 32 hexadecimal digits, with 


every four digits separated by a colon. 
128 bits = 16 bytes = 32 hex digits 











1111110111101100 ora 1999999919991111 


U 
[se : [Ra = [Ree : [ee : [ee : [ee : ee] : ee) 


Abbreviation 








Although the IP address, even in hexadecimal format, is very long, many of the digits are Zeros. 
In this case, we can abbreviate the address. 


Original 


Abbreviated 








More abbreviated | FDEC £74 5,5 BOFF 3 0 3 FFFO 


Tees 











Address Space 
IPv6 has a much larger address space; 2'?* addresses are available. The designers of IPv6 divided 


the address into several categories. A few leftmost bits, called the type prefix, in each address 
define its category. The type prefix is variable in length, but it is designed such that no code is 


identical to the first part of any other code. 
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Type Prefix Type Fraction 
0000 0000 Reserved 1/256 
0000 0001 Unassigned 1/256 
0000 001 ISO network addresses 1/128 
0000 010 IPX (Novell) network addresses 1/128 
0000 O11 Unassigned 1/128 
0000 1 Unassigned 1/32 
0001 Reserved 1/16 
001 Reserved 1/8 
010 Provider-based unicast addresses 1/8 

Type Prefix Type Fraction 
Oll Unassigned V/s 
100 Geographic-based unicast addresses 1/8 
101 Unassigned Vs 
110 Unassigned 1/8 
1110 Unassigned 1/16 
11110 Unassigned 1/32 
1111 10 Unassigned 1/64 
1111 110 Unassigned 1/128 
11111100 Unassigned 1/512 
1111 111010 | Link local addresses 1/1024 
1111 111011 | Site local addresses 1/1024 
L111 1111 Multicast addresses 1/256 

Unicast Addresses 


A unicast address defines a single computer. The packet sent to a unicast address must be 


delivered to that specific computer. IPv6 defines two types of unicast addresses: geographically 


based and provider-based. 
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Subnet prefix 











Subscriber prefix 





Provider prefix 








Subscriber | Node 
identifier z = identifier 


Registry 


Fields for the provider-based address are as follows: 

e Type identifier. This 3-bit field defines the address as a provider-based address. 

e Registry identifier. This 5-bit field indicates the agency that has registered the address. 
Currently three registry centers have been defined. INTERNIC (code 11000) is the center for 
North America; RIPNIC (code 01000) is the center for European registration; and APNIC 
(code 10100) is for Asian and Pacific countries. 

e Provider identifier. This variable-length field identifies the provider for Internet access 
(such as an ISP). 

e Subscriber identifier. When an organization subscribes to the Internet through a provider, it 
is assigned a subscriber identification. 

e Subnet identifier. Each subscriber can have many different subnetworks, and each 
subnetwork can have an identifier. The subnet identifier defines a specific subnetwork under 
the territory of the subscriber. 


e Node identifier. The last field defines the identity of the node connected to a subnet. 


Multicast Addresses 
Multicast addresses are used to define a group of hosts instead of just one. A packet sent to a 


multicast address must be delivered to each member of the group. 
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112 bits 











Group ID 





e The second field is a flag that defines the group address as either permanent or transient. A 


permanent group address is defined by the Internet authorities and can be accessed at all 
times. A transient group address, on the other hand, is used only temporarily. 


e The third field defines the scope of the group address. 


Anycast Addresses 
An anycast address, like a multicast address, also defines a group of nodes. However, a packet 
destined for an anycast address is delivered to only one of the members of the anycast group, the 


nearest one (the one with the shortest route). 


Reserved Addresses 


Another category in the address space is the reserved address. These addresses start with eight Os 
8 bits 120 bits 





[socoooo] Allos | a. Unspecified 


8 bits 120 bits 





b, Loopback 





B bits 88 bits 32 bits 
All Os : 








8 bits 72 bits 16 bits 32 bits 





All Os 





e An unspecified address is used when a host does not know its own address and sends an 
inquiry to find its address. 


e A loopback address is used by a host to test itself without going into the network. 
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« A compatible address is used during the transition from IPv4 to IPv6. It is used when a 
computer using IPv6 wants to send a message to another computer using IPv6, but the 


message needs to pass through a part of the network that still operates in IPv4. 


A mapped address is also used during transition. However, it is used when a computer that 


has migrated to IPv6 wants to send a packet to a computer still using IPv4. 


Local Addresses 
These addresses are used when an organization wants to use IPv6 protocol without being 


connected to the global Internet. 





10 bits 70 bits 48 bits 
[med All Os [ neseacces | a. Link local 
10 bits 38 bits 32 bits 48 bits 





pron All Os Ez b. Site local 


5.3 Internetworking 


The physical and data link layers of a network operate locally. These two layers are jointly 


responsible for data delivery on the network from one node to the next. 


A s1 $3 D 
Data link Data link 
Physical Physical 
Hop-to-hop Hop-to-hop Hop-to-hop 
delivery delivery delivery 
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This internetwork is made of five networks: four LANs and one WAN. 


If host A needs to send a data packet to host D, the packet needs to go first from A to RI (a 


switch or router), then from RI to R3, and finally from R3 to host D. 


In each link, two physical and two data link layers are involved. However, there is a big problem 
here. When data arrive at interface fl of RI, There is no provision in the data link (or physical) 
layer to help RI make the right decision. The frame does not carry any routing information either. 
The frame contains the MAC address of A as the source and the MAC address of RI as the 


destination. 
Need for Network Layer 


To solve the problem of delivery through several links, the network layer (or the internetwork 
layer, as it is sometimes called) was designed. 
The network layer is responsible for host-to-host delivery and for routing the packets through the 


routers or switches. 


A SI $3 D 
Network Network 
Data link Data link 
Physical Physical 








Host-to-host path 


The network layer at the source is responsible for creating a packet from the data coming from 


another protocol (such as a transport layer protocol or a routing protocol). The header of the 
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packet contains, among other information, the logical addresses of the source and destination. 
The network layer is responsible for checking its routing table to find the routing information 
(such as the outgoing interface of the packet or the physical address of the next node). If the 


packet is too large, the packet is fragmented. 


The network layer at the switch or router is responsible for routing the packet. When a packet 
arrives, the router or switch consults its routing table and finds the interface from which the 
packet must be sent. The packet, after some changes in the header, with the routing infonnation 


is passed to the data link layer again. 


The network layer at the destination is responsible for address verification; it makes sure that the 
destination address on the packet is the same as the address of the host. If the packet is a 
fragment, the network layer waits until all fragments have arrived, and then reassembles them 


and delivers the reassembled packet to the transport layer. 





Data from Data to 
another protocol another protocol 
Source Destination 







































IP packet 
and routing 
iSzeers 4 
Network layer Network layer 
To data From data 
link layer link layer 
a. Network layer at source b. Network layer at destination 
Router 
= Routing 


table 
























Ip IP packet 
packet and routing | 
information | 
Network layer 
From data To data 
link layer link layer 





c. Network layer at a router 
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5.4 IPv4 


The Internet Protocol version 4 (IPv4) is the delivery mechanism used by the TCP/IP protocols. 
IPv4 is an unreliable and connectionless datagram protocol-a best-effort delivery service. The 
term best-effort means that IPv4 provides no error control or flow control. IPv4 assumes the 
unreliability of the underlying layers and does its best to get a transmission through to its 
destination, but with no guarantees. If reliability is important, IPv4 must be paired with a reliable 


protocol such as TCP. 
Datagram 


Packets in the IPv4 layer are called datagrams. A datagram is a variable-length packet consisting 
of two parts: header and data. The header is 20 to 60 bytes in length and contains information 


essential to routing and delivery. 


20-65,536 bytes 


20-60 bytes 































VER HLEN Service Total length 
4 bits 4 bits 8 bits 16 bits 
Identification| Flags Fragmentation offset 
16 bits 3 bits 13 bits 
Time to live Protocol Header checksum 
8 bits 8 bits 16 bits 











Source IP address 





Destination IP address 











32 bits 


e Version: Version no. of Internet Protocol used (e.g. IPv4). 

« HLEN: This 4-bit field defines the total length of the datagram header in 4-byte words. This 
field is needed because the length of the header is variable (between 20 and 60 bytes). When 
there are no options, the header length is 20 bytes, and the value of this field is 5 (5 x 4 = 20). 
When the option field is at its maximum size, the value of this field is 15 (15 x 4 = 60). 


¢ Service: This field, previously called service type, is now called differentiated services. 
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D: Minimize delay R: Maximize reliability 
T:Maximize throughput —C: Minimize cost 


Glog | nee 


Codepoint 

















Precedence TOS bits 





Service type Differentiated services 


Service Type: 
In this interpretation, the first 3 bits are called precedence bits ranging from 0 (000 in binary) to 7 
(111 in binary). The precedence defines the priority of the datagram in issues such as congestion.. The 


next 4 bits are called type of service (TOS) bits, and the last bit is not used. 





TOS Bits Description 
0000 Normal (default) 
0001 Minimize cost 
0010 Maximize reliability 














0100 Maximize throughput 
1000 Minimize delay 

















Application programs can request a specific type of service. The defaults for some applications are: 



























































Protocol TOS Bits Description 
ICMP 0000 Normal 
BOOTP. 0000 Normal 
NNTP 0001 Minimize cost 
IGP OOLO Maximize reliability 
SNMP OOLO Maximize reliability 
TELNET 1000 Minimize delay 
FTP (data) 0100 Maximize throughput 
FTP (control) 1000 Minimize delay 
TFTP 1000 Minimize delay 
SMTP (command) 1000 Minimize delay 
SMTP (data) 0100 Maximize throughput 
DNS (UDP query) 1000 Minimize delay 
DNS (TCP query) | 0000 | Normal 
DNS (zone) 0100 Maximize throughput 
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Differentiated Services 

In this interpretation, the first 6 bits make up the codepoint subfield, and the last 2 bits are not 
used. The codepoint subfield can be used in two different ways. 

a. When the 3 rightmost bits are 0s, the 3 leftmost bits are interpreted the same as the 
precedence bits in the service type interpretation. 

b. When the 3 rightmost bits are not all Os, the 6 bits define 64 services based on the priority 


assignment by the Internet or local authorities. 


¢ Total Length: Length of entire IP Packet (including IP header and IP Payload). 
Length of data =total length - header length 

¢ Identification: If IP packet is fragmented during the transmission, all the fragments contain 
same identification number to identify original IP packet they belong to. 

«Flags: As required by the network resources, if IP Packet is too large to handle, these ‘flags’ 
tells if they can be fragmented or not. In this 3-bit flag, the MSB is always set to ‘0’. 

« Fragment Offset: This offset tells the exact position of the fragment in the original IP 
Packet. 

¢ Time to Live: To avoid looping in the network, every packet is sent with some TTL value 
set, which tells the network how many routers (hops) this packet can cross. At each hop, its 
value is decremented by one and when the value reaches zero, the packet is discarded. 

¢ Protocol: Tells the Network layer at the destination host, to which Protocol this packet 


belongs to, i.e. the next level Protocol. 


1 ICMP 
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« Header Checksum: This field is used to keep checksum value of entire header which is then 


used to check if the packet is received error-free. 


























Example: 
4 5 0 | 28 
i 0 () 
4 
10.12.14.5 
12.6.7.9 
4,5,and0 —*»> 4 5 
2 
1— 


OandQ ——> 
4and 17, ——> 
o — 

10.12. —— 
14.5 — 
12.6 —~ 

79 —>r 


Sum ——> 
Checksum = ——> 





a®NJOo0CCCCOONO 
ma/NOmroRoCcO 
mplOooooco0-00=-0 
am|lonuno=o-n0 


« Source Address: 32-bit IPv4 address of the Sender (or source) of the packet. 


¢ Destination Address: 32-bit IPv4 address of the Receiver (or destination) of the packet. 


Fragmentation 

A datagram can travel through different networks. Each router decapsulates the IPv4 datagram 
from the frame it receives, processes it, and then encapsulates it in another frame. The format 
and size of the received frame depend on the protocol used by the physical network through 
which the frame has just traveled. The format and size of the sent frame depend on the protocol 


used by the physical network through which the frame is going to travel. 


Maximum Transfer Unit (MTU) 
Each data link layer protocol has its own frame format. One of the fields defined in the format is 
the maximum size of the data field. The value of the MTU depends on the physical network 


protocol. 
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IP datagram 
[__Hesser | MTU [_taier | 
Maximum length of data to be encapsulated in a frame 
Frame 
Protocol MTU 
Hyperchannel 65,535 
Token Ring (16 Mbps) 17,914 
Token Ring (4 Mbps) 4,464 
FDDI 4,352 
Ethernet 1,500 
X.25 576 
PPP 296 





To make the IPv4 protocol independent of the physical network, the designers decided to make 
the maximum length of the IPv4 datagram equal to 65,535 bytes. 

When a datagram is fragmented, each fragment has its own header with most of the fields 
repeated, but with some changed. A fragmented datagram may itself be fragmented if it 
encounters a network with an even smaller MTU. In other words, a datagram can be fragmented 


several times before it reaches the final destination. 


Fields Related to Fragmentation 

e Identification. This 16-bit field identifies a datagram originating from the source host. The 
combination of the identification and source IPv4 address must uniquely define a datagram 
as it leaves the source host. To guarantee uniqueness, the IPv4 protocol uses a counter to 
label the datagrams. The counter is initialized to a positive number. When the IPv4 protocol 
sends a datagram, it copies the current value of the counter to the identification field and 
increments the counter by 1. 

e Flags. This is a 3-bit field. The first bit is reserved. The second bit is called the do not 


fragment bit. If its value is 1, the machine must not fragment the datagram. If its value is 0, 
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the datagram can be fragmented if necessary. The third bit is called the more fragment bit. If 
its value is 1, it means the datagram is not the last fragment; there are more fragments after 


this one. If its value is 0, it means this is the last or only fragment 


| > | m | D: Do not fragment 
M: More fragments 
¢ Fragmentation offset. This 13-bit field shows the relative position of this fragment with 


respect to the whole datagram. It is the offset of the data in the original datagram measured in 


units of 8 bytes. 





Offset = 0000/8 = 0 






















Offset = 0000/8 =0 a Mead 




















Offset = 1400/8 = 175 


i 2799 


Byte 0000 Byte 3999 





Offset = 2800/8 = 350 


2800 3999 
1420 






































820 


14,567 17s | 




















Fragment 1 


14,567 1 


Fragment 2.1 


620 
© Bre i0T is _— 









































































































































Fragment 2 
Original datagram 
14,567 lo] 350, | Fragment 2.2 
Fragment 3 
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Options: 
This is optional field, which is used if the value of HEL is greater than 5. These options may 


contain values for options such as Security, Record Route, Time Stamp, etc. 






Single-byte 


Multiple-byte 


> No Operation 
A no-operation option is a 1-byte option used as a filler between options. 

> End of Option 
An end-of-option option is a 1-byte option used for padding at the end of the option field. It, 
however, can only be used as the last option. 

> Record Route 
A record route option is used to record the Internet routers that handle the datagram. It can 
list up to nine router addresses. It can be used for debugging and management purposes. 

>» Strict Source Route 
A strict source route option is used by the source to predetermine a route for the datagram as 
it travels through the Internet. Dictation of a route by the source can be useful for several 
purposes. The sender can choose a route with a specific type of service, such as minimum 
delay or maximum throughput. Alternatively, it may choose a route that is safer or more 
reliable for the sender's purpose. 

> Loose Source Route 
A loose source route option is similar to the strict source route, but it is less rigid. Each router 


in the list must be visited, but the datagram can visit other routers as well. 
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> Timestamp 
A timestamp option is used to record the time of datagram processing by a router. The time is 
expressed in milliseconds from midnight, Universal time or Greenwich mean time. Knowing 
the time a datagram is processed can help users and managers track the behavior of the 


routers in the Internet. 


5.5 IPv6 


Advantages 

e Larger address space. An IPv6 address is 128 bits long. Compared with the 32-bit address 
of IPv4, this is a huge increase in the address space. 

¢ Better header format. IPv6 uses a new header format in which options are separated from 
the base header and inserted, when needed, between the base header and the upper-layer data. 
This simplifies and speeds up the routing process because most of the options do not need to 
be checked by routers. 

e New options. IPv6 has new options to allow for additional functionalities. 

e Allowance for extension. IPv6 is designed to allow the extension of the protocol if required 
by new technologies or applications. 

e Support for resource allocation. In IPv6, the type-of-service field has been removed, but a 
mechanism has been added to enable the source to request special handling of the packet. 
This mechanism can be used to support traffic such as real-time audio and video. 

¢ Support for more security. The encryption and authentication options in IPv6 provide 


confidentiality and integrity of the packet. 


Packet Format 

Each packet is composed of a mandatory base header followed by the payload. The payload 
consists of two parts: optional extension headers and data from an upper layer. The base header 
occupies 40 bytes, whereas the extension headers and data from the upper layer contain up to 


65,535 bytes of information. 
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|. 40 bytes | Up to 65,535 bytes | 













Payload 


Data packet from upper layer 


|_Abits | 4bits | sbits | 8bits | Bbits | 


















<mCVER PRI Flow label 
















Payload length 





Source address 





Destination address 





—- 
Header length 
— | 





Header length 





Header length 














© Version. This 4-bit field defines the version number of the IP. For IPv6, the value is 6. 

e Priority. The 4-bit priority field defines the priority of the packet with respect to traffic 
congestion. 

© Flow label. The flow label is a 3-byte (24-bit) field that is designed to provide special 


handling for a particular flow of data. 
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e Payload length. The 2-byte payload length field defines the length of the IP datagram 
excluding the base header. 

e Next header. The next header is an 8-bit field defining the header that follows the base 
header in the datagram. The next header is either one of the optional extension headers used 
by IP or the header of an encapsulated packet such as UDP or TCP. Each extension header 


also contains this field. 























Code Next Header 
0 Hop-by-hop option 
2 ICMP 
6 TCP 
17 UDP 
43 Source routing 
44 Fragmentation 





50 Encrypted security payload 





51 Authentication 





59 Null (no next header) 














60 Destination option 





e Hop limit. This 8-bit hop limit field serves the same purpose as the TIL field in IPv4. 

e Source address. The source address field is a 16-byte (128-bit) Internet address that 
identifies the original source of the datagram. 

e Destination address. The destination address field is a 16-byte (128-bit) Internet address 


that usually identifies the final destination of the datagram. 


Priority 

The priority field of the IPv6 packet defines the priority of each packet with respect to other 
packets from the same source. IPv6 divides traffic into two broad categories: congestion- 
controlled and noncongestion-controlled. 

Congestion-Controlled Traffic: 

e  Ifasource adapts itself to traffic slowdown when there is congestion, the traffic is referred to 


as congestion-controlled traffic. 
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e In congestion-controlled traffic, it is understood that packets may arrive delayed, lost, or out 
of order. Congestion-controlled data are assigned priorities from 0 to 7. A priority of 0 is the 


lowest; a priority of 7 is the highest. 
































Priority Meaning 
0 No specific traffic 
1 Background data 
2 Unattended data traffic 
3 Reserved 
4 Attended bulk data traffic 
5 Reserved 
6 Interactive traffic 
7 Control traffic 














e No specific traffic. A priority of 0 is assigned to a packet when the process does not define a 
priority. 

e Background data. This group (priority 1) defines data that are usually delivered in the 
background. Delivery of the news is a good example. 

e Unattended data traffic. If the user is not waiting (attending) for the data to be received, the 
packet will be given a priority of 2. E-mail belongs to this group. 

e Attended bulk data traffic. A protocol that transfers data while the user is waiting 
(attending) to receive the data (possibly with delay) is given a priority of 4. FTP and HTTP 
belong to this group. 

e Interactive traffic. Protocols such as TELNET that need user interaction are assigned the 
second-highest priority (6) in this group. 

e Control traffic. Control traffic is given the highest priority (7). Routing protocols such as 


OSPF and RIP and management protocols such as SNMP have this priority. 


Noncongestion-Controlled Traffic: This refers to a type of traffic that expects minimum delay. 
Discarding of packets is not desirable. Retransmission in most cases is impossible. Real-time 


audio and video are examples of this type of traffic. 
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Priority Meaning 
8 Data with greatest redundancy 
15 | Data with least redundancy 
Flow Label 


e A sequence of packets, sent from a particular source to a particular destination, that needs 
special handling by routers is called a flow of packets. The combination of the source address 
and the value of the flow label uniquely defines a flow of packets. 

e To a router, a flow is a sequence of packets that share the same characteristics, such as 
traveling the same path, using the same resources, having the same kind of security, and so 
on. 

e A router that supports the handling of flow labels has a flow label table. 

e The table has an entry for each active flow label; each entry defines the services required by 
the corresponding flow label. 

e When the router receives a packet, it consults its flow label table to find the corresponding 
entry for the flow label value defined in the packet. It then provides the packet with the 


services mentioned in the entry. 


Comparison Between IPv4 and IPv6 Headers 





Comparison 





1. The no-operation and end-of-option options in IPv4 are replaced by Pad1 and PadN 














options in [Pv6. 
2. The record route option is not implemented in IPv6 because it was not used. 
3. The timestamp option is not implemented because it was not used. 
4. The source route option is called the source route extension header in [Pv6. 
5. The fragmentation fields in the base header section of IPv4 have moved to the fragmentation 


extension header in IPv6, 





6. The authentication extension header is new in IPv6. 

















7. The encrypted security payload extension header is new in IPy6. 
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Extension Headers 
The length of the base header is fixed at 40 bytes. However, to give greater functionality to the 


IP datagram, the base header can be followed by up to six extension headers. 
Pad1 


PadN 
Jumbo payload 








«© Hop-by-Hop Option 
The hop-by-hop option is used when the source needs to pass information to all routers 
visited by the datagram. The Pad | option is 1 byte long and is designed for alignment 
purposes. Pad N is similar in concept to Pad 1. The jumbo payload option is used to define a 
payload longer than 65,535 bytes. 

e Source Routing 
The source routing extension header combines the concepts of the strict source route and the 
loose source route options of IPv4. 

e Fragmentation 
In IPv6, only the original source can fragment. A source must use a path MTU discovery 
technique to find the smallest MTU supported by any network on the path. The source then 
fragments using this knowledge. 

© Authentication 
The authentication extension header has a dual purpose: it validates the message sender and 
ensures the integrity of data. 

e Encrypted Security Payload 
The encrypted security payload (ESP) is an extension that provides confidentiality and 


guards against eavesdropping. 
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e Destination Option 
The destination option is used when the source needs to pass information to the destination 


only. Intermediate routers are not permitted access to this information. 


Comparison between IPv4 options and IPv6 extension headers 





Comparison 





. The no-operation and end-of-option options in IPv4 are replaced by Pad! and PadN 
options in IPv6. 





np 


. The record route option is not implemented in [Pv6 because it was not used. 





. The timestamp option is not implemented because it was not used. 





. The source route option is called the source route extension header in [Pv6. 





wl alo 


. The fragmentation fields in the base header section of IPv4 have moved to the fragmentation 
extension header in IPv6. 





6. The authentication extension header is new in IPv6. 


7. The encrypted security payload extension header is new in IPv6. 











5.6 TRANSITION FROM IPv4 TO IPv6 


Three strategies have been defined to help the transition. 









Transition 
strategies 





Dual Stack 


It is recommended that all hosts, before migrating completely to version 6, have a dual stack of 
protocols. In other words, a station must run IPy4 and IPv6 simultaneously until all the Internet 


uses IPv6. 
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Transport and 
application layers 














To IPv4 system To IPv6 system 


Tunneling 


Tunneling is a strategy used when two computers using IPv6 want to communicate with each 
other and the packet must pass through a region that uses IPv4. To pass through this region, the 
packet must have an IPv4 address. So the IPv6 packet is encapsulated in an IPv4 packet when it 


enters the region, and it leaves its capsule when it exits the region. 


IPv6 header ———————— IPv6 header 











Header Translation 


The sender wants to use IPv6, but the receiver does not understand IPv6. Tunneling does not 
work in this situation because the packet must be in the IPv4 format to be understood by the 
receiver. In this case, the header format must be totally changed through header translation. The 


header of the IPv6 packet is converted to an IPv4 header. 
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IPv6 header 


Header 
translation 
done here 








Header Translation Procedure 





1, The IPv6 mapped address is changed to an IPv4 address by extracting the rightmost 32 bits. 





nv 


. The value of the IPv6 priority field is discarded. 





we 


. The type of service field in IPv4 is set to zero. 





. The checksum for [Pv4 is calculated and inserted in the corresponding field. 





. The IPv6 flow label is ignored. 





alu] se 


. Compatible extension headers are converted to options and inserted in the IPv4 header. 
Some may have to be dropped. 





x 


. The length of IPv4 header is calculated and inserted into the corresponding field. 








8. The total length of the IPv4 packet is calculated and inserted in the corresponding field. 
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Module — 2 


TRANSPORT LAYER 


2.1 Introduction and Transport-Layer Services 


A transport-layer protocol provides for logical communication between application processes 
running on different hosts. 

Application processes use the logical communication provided by the transport layer to send 
messages to each other, free from the worry of the details of the physical infrastructure used 
to carry these messages. 

On the sending side, the transport layer converts the application-layer messages it receives 
from a sending application process into transport-layer packets, known as transport-layer 
segments. 

This is done by (possibly) breaking the application messages into smaller chunks and adding 
a transport-layer header to each chunk to create the transport-layer segment. 

The transport layer then passes the segment to the network layer at the sending end system, 
where the segment is encapsulated within a network-layer packet (a datagram) and sent to the 
destination. 

On the receiving side, the network layer extracts the transport-layer segment from the 
datagram and passes the segment up to the transport layer. 

The transport layer then processes the received segment, making the data in the segment 
available to the receiving application. 

Internet has two protocols—TCP and UDP. Each of these protocols provides a different set 


of transport-layer services to the invoking application. 


2.1.1 Relationship between Transport and Network Layers 


Transport Layer provides Process to process delivery service whereas network layer provides 
end to end delivery of data. 
Transport-layer protocol provides logical communication between processes running on 


different hosts, a network-layer protocol provides logical communication between hosts. 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 1 


Module — 2: Transport Layer 





Within an end system, a transport protocol moves messages from application processes to the 
network edge (that is, the network layer) and vice versa, but it doesn’t have any say about 
how the messages are moved within the network core. 

The services that a transport protocol can provide are often constrained by the service model 
of the underlying network-layer protocol. If the network-layer protocol cannot provide delay 
or bandwidth guarantees for transport layer segments sent between hosts, then the transport- 
layer protocol cannot provide delay or bandwidth guarantees for application messages sent 


between processes. 


2.1.2 Overview of the Transport Layer in the Internet 


The Internet supports two transport layer protocols: 


1) UDP (User Datagram Protocol), which provides an unreliable, connectionless service to the 


invoking application. 


2) TCP (Transmission Control Protocol), which provides a reliable, connection-oriented service 


to the invoking application. 


The Internet’s network-layer protocol has Internet Protocol. IP provides logical 
communication between hosts. 

The IP service model is a best-effort delivery service. This means that IP makes its “best 
effort” to deliver segments between communicating hosts, but it makes no guarantees. In 
particular, it does not guarantee segment delivery, it does not guarantee orderly delivery of 
segments, and it does not guarantee the integrity of the data in the segments. 

The most fundamental responsibility of UDP and TCP is to extend IP’s delivery service 
between two end systems to a delivery service between two processes running on the end 


systems. Extending host-to-host delivery to proces 





-to-process delivery is called transport- 
layer multiplexing and demultiplexing. 

UDP and TCP also provide integrity checking by including error detection fields in their 
segments’ headers. 

UDP is an unreliable service it does not guarantee that data sent by one process will arrive 
intact to the destination process. 

TCP, on the other hand, offers several additional services to applications. First and foremost, 


it provides reliable data transfer. Using flow control, sequence numbers, acknowledgments, 
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and timers, TCP ensures that data is delivered from sending process to receiving process, 
correctly and in order. 

TCP thus converts IP’s unreliable service between end systems into a reliable data transport 
service between processes. 

TCP also provides congestion control. TCP congestion control prevents any one TCP 
connection from swamping the links and routers between communicating hosts with an 
excessive amount of traffic. 

UDP traffic, on the other hand, is unregulated. An application using UDP transport can send 


at any rate it pleases, for as long as it pleases. 


2.2 Multiplexing and Demultiplexing 


At the destination host, the transport layer receives segments from the network layer just 
below. 

The transport layer has the responsibility of delivering the data in these segments to the 
appropriate application process running in the host. 

A process can have one or more sockets, doors through which data passes from the network 


to the process and through which data pa 





‘s from the process to the network. 

The transport layer in the receiving host does not actually deliver data directly to a process, 
but instead to an intermediary socket. 

Because at any given time there can be more than one socket in the receiving host, each 
socket has a unique identifier. 

Each transport-layer segment has a set of fields in the segment to help receiver to deliver data 
to appropriate process socket. 

At the receiving end, the transport layer examines these fields to identify the receiving socket 
and then directs the segment to that socket. This job of delivering the data in a transport-layer 
segment to the correct socket is called demultiplexing. 

The job of gathering data chunks at the source host from different sockets, encapsulating 
each data chunk with header information to create segments, and passing the segments to the 


network layer is called multiplexing. 
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Application eB, > Application “P, 








Application 







Network 
Data link 





Network 
Data link 
Physical 


Network 
Data link 
Physical 



































Physical 














Key: 
Process socket 


e  Transport-layer multiplexing requires (1) that sockets have unique identifiers, and (2) that 
each segment have special fields that indicate the socket to which the segment is to be 
delivered. These special fields are the source port number field and the destination port 
number field. 

e Each port number is a 16-bit number, ranging from 0 to 65535. The port numbers ranging 
from 0 to 1023 are called well-known port numbers and are restricted, which means that they 
are reserved for use by well-known application protocols such as HTTP (which uses port 
number 80) and FTP (which uses port number 21). 

32 bits 


Source port # Dest. port # 
Other header fields 
Application 
data 
(message) 


e UDP performs connectionless multiplexing and demultiplexing. TCP performs connection 


oriented multiplexing and demultiplexing. 
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2.3 Connectionless Transport: UDP 


e UDP isa connectionless protocol which performs only multiplexing/demultiplexing function 
and some light error checking. 

e UDP takes messages from the application process, attaches source and destination port 
number fields for the multiplexing/demultiplexing service, adds two other small fields, and 
passes the resulting segment to the network layer. 

e The network layer encapsulates the transport-layer segment into an IP datagram and then 
makes a best-effort attempt to deliver the segment to the receiving host. 

e Ifthe segment arrives at the receiving host, UDP uses the destination port number to deliver 


the segment’s data to the correct application process. 


Many applications are better suited for UDP for the following reasons: 
1) Finer application-level control over what data is sent, and when: 


es data to UDP, UDP will package the 





e Under UDP, as soon as an application process 





pe 
data inside a UDP segment and immediately pass the segment to the network layer. 

e TCP, on the other hand, has a congestion-control mechanism that throttles the transport-layer 
TCP sender when one or more links between the source and destination hosts become 
excessively congested. TCP will also continue to resend a segment until the receipt of the 
segment has been acknowledged by the destination. 

e Since real-time applications often require a minimum sending rate, do not want to overly 
delay segment transmission, and can tolerate some data loss, TCP’s service model is not 
particularly well matched to these applications’ needs. 

2) No connection establishment: 

TCP uses a three-way handshake to establish the connection before it starts to transfer data. UDP 

just sends the data without any formal preliminaries. Thus UDP does not introduce any delay to 

establish a connection. 

3) No connection state: 

TCP maintains connection state in the end systems. This connection state includes receive and 

send buffers, congestion-control parameters, and sequence and acknowledgment number 


parameters. 
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UDP, on the other hand, does not maintain connection state and does not track any of these 
parameters. For this reason, a server devoted to a particular application can typically support 
many more active clients when the application runs over UDP rather than TCP. 

4) Small packet header overhead: 

The TCP segment has 20 bytes of header overhead in every segment, whereas UDP has only 8 


bytes of overhead. 






































‘Application Layer Underlying Transport 

Application Protocol Protocol 
Hectronic mail SMIP Ter 
Remote terminal access Telnet TCP 
Web HITP Top 
File transfer FIP TCP 
Remote file server NFS Typically UDP 
Streaming multimedia typically proprietary UDP or TCP 
Internet telephony typically proprietary UDP or TCP 
Network management SNMP Typically UDP 
Routing protocol RIP Typically UDP 
Name translation DNS Typically UDP 

Popular Internet applications and their underlying transport 

protocols 

2.3.1 UDP Segment Structure 
32 bits 


SO—— | 


Source port # Dest. port # 
Length Checksum 
Sppleion 
(message) 
e The UDP header has only four fields, each consisting of two bytes. 
e The port numbers allow the destination host to pass the application data to the correct process 
running on the destination end system 
e The length field specifies the number of bytes in the UDP segment (header plus data). 
e The checksum is used by the receiving host to check whether errors have been introduced 
into the segment. 


e The application data occupies the data field of the UDP segment. 
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2.3.2 UDP Checksum 


The checksum is used to determine whether bits within the UDP segment have been altered as it 
moved from source to destination. 

Step1: Add all the data elements using binary addition (Modulo-2 addition). If you get extra bit 
wrap it. 


0110011001100000 
0101010101010101 
1000111100001100 


The sum of first two of these 16-bit words is 


0110011001100000 
0101010101010101 
1OLNIOIIIONIOIOL 


Adding the third word to the above sum gives 


1O11101110110101 
1000111100001100 
0100101011000010 


Step 2: Take 1s complement of the result. 

The 1s complement is obtained by converting all the Os to 1s and converting all the Is to Os. 
Thus the 1s complement of the sum 0100101011000010 is 1011010100111101, which becomes 
the checksum. 

Step 3: Data along with checksum is transmitted to receiver. 

Step 4: at the receiver side add all the data and checksum using binary addition. Wrap the extra 
bit and take 1s complement of the result. This will be the checksum. If checksum is all 0’s 
receiver has received error free data otherwise it has received corrupted data. 

2.4 Principles of Reliable Data Transfer 

The service abstraction provided to the upper-layer entities is that of a reliable channel through 
which data can be transferred. With a reliable channel, no transferred data bits are corrupted 
(flipped from 0 to 1, or vice versa) or lost, and all are delivered in the order in which they were 
sent. 

It is the responsibility of a reliable data transfer protocol to implement this service abstraction. 
This task is made difficult by the fact that the layer below the reliable data transfer protocol may 


be unreliable. 
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a a a a 


Sending >) (Receiver 
Application \ process / (process 


layer 
































Les} 
pat _ooay | 
: 
Reliable data Reliable data 
eet itamsfer proincol transtor protec 
Gandinig side) Gecahn deh 
Reliable channel 2 EI 
Network 
layer —_—_———— 
Unreliable channel 
l T T 
2 Provided service b. Service implementation 
Key: 


@oate = Bracket 
e The sending side of the data transfer protocol will be invoked from above by a call to 
rdt_send(). It will pass the data to be delivered to the upper layer at the receiving side. 
e On the receiving side, rdt_rev() will be called when a packet arrives from the receiving side 
of the channel. 
e¢ When the rdt protocol wants to deliver data to the upper layer, it will do so by calling 
deliver_data(). 


e Both the send and receive sides of rdt send packets to the other side by a call to udt_send() 


2.4.1 Building a Reliable Data Transfer Protocol 
Reliable Data Transfer over a Perfectly Reliable Channel: rdt1.0 


e The sending side of rdt simply accepts data from the upper layer via the rdt_send(data) event, 
creates a packet containing the data (via the action make_pkt(data)) and sends the packet into 
the channel. 

e On the receiving side, rdt receives a packet from the underlying channel via the 
rdt_rev(packet) event, removes the data from the packet (via the action extract (packet, data)) 
and passes the data up to the upper layer (via the action deliver_data(data)). 

e Here all packet flow is from the sender to receiver; with a perfectly reliable channel there is 


no need for the receiver side to provide any feedback to the sender since nothing can go 
wrong. 
e Also we have assumed that the receiver is able to receive data as fast as the sender happens to 


send data. Thus, there is no need for the receiver to ask the sender to slow down. 
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on 
Gratton B= send (data) 
call from packet-make prt (data) 
‘above tdt_send (packet) 


a. rdt1.0: sending side 





Wait for rét_rev( packet) 


call from extract (packet, data) 
below deliver _data(data) 


b. rdt1.0: receiving side 


Reliable Data Transfer over a Channel with Bit Errors: rdt2.0 

A more realistic model of the underlying channel is one in which bits in a packet may be 

corrupted. 

Three additional protocol capabilities are required in (Automatic Repeat Request) ARQ 

protocols to handle the presence of bit errors: 

+ Error detection. First, a mechanism is needed to allow the receiver to detect when bit errors 
have occurred. 

+ Receiver feedback. Since the sender and receiver are typically executing on different end 
systems, possibly separated by thousands of miles, the only way for the sender to learn of the 
receiver's view of the world is for the receiver to provide explicit feedback to the sender. The 
positive (ACK) and negative (NAK) acknowledgment replies in the message-dictation 
scenario are examples of such feedback. 

+ Retransmission. A packet that is received in error at the receiver will be retransmitted by the 


sender. 
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sndpkt=make_pkt (data,checksum) 
udt_send(sndpkt ) 


Wait for Wait for rdt_rev(revpkt ) 6& isWAK( revpkt) 
call from ACK or 
above NAK 


No 


rdt_rov(revpkt) 6& isACK(revpkt) 


udt_send(sndpkt) 


ri 
a. rdt2.0: sending side 


rat_rev( revpkt) && corrupt (rcvpkt) 


sndpkt=make_pkt(NAK) 
udt_send(sndpkt) 





C Du. rev(rcvpkt) £6 notcorrupt(revpkt) 


extract(revpkt ,data) 
deliver_data(data) 
sndpkt-make_pkt (ACK) 
udt_send(sndpkt) 


b. rdt2.0: recelving side 


The sender side has two states. In the leftmost state, the send-side protocol is waiting for data 
to be passed down from the upper layer. 

When the rdt_send(data) event occurs, the sender will create a packet (sndpkt) containing the 
data to be sent, along with a packet checksum and then send the packet via the 
udt_send(sndpkt) operation. 

In the rightmost state, the sender protocol is waiting for an ACK or a NAK packet from the 
receiver. If an ACK packet is received (the notation rdt_rev(revpkt) && isACK (revpkt)), the 
sender knows that the most recently transmitted packet has been received correctly and thus 
the protocol returns to the state of waiting for data from the upper layer. 

If a NAK is received, the protocol retransmits the last packet and waits for an ACK or NAK 
to be returned by the receiver in response to the retransmitted data packet. 

When the sender is in the wait-for-ACK-or-NAK state, it cannot get more data from the 
upper layer; that is, the rdt_send() event can not occur; that will happen only after the sender 


receives an ACK and leaves this state. Thus, the sender will not send a new piece of data 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 10 


Module — 2: Transport Layer 





until it is sure that the receiver has correctly received the current packet. Because of this 


behavior, protocols are known as stop-and-wait protocols. 





CK or NAK is corrupted (Duplicate Packet): 

+ In this case the sender resends the current data packet when it receives a garbled ACK or 
NAK packet. This approach, however, introduces duplicate packets into the sender-to- 
receiver channel. 

+ Assimple solution to this problem is to add a new field to the data packet and have the sender 
number its data packets by putting a sequence number into this field. The receiver then need 
only check this sequence number to determine whether or not the received packet is a 
retransmission. 

+ For stop-and wait protocol, a 1-bit sequence number will suffice, since it will allow the 


receiver to know whether the sender is resending the previously transmitted packet or a new 








packet. 
Sender: 
rdt_send(data) 
sndpkt=make_pkt (0, data,checksum) 
udt_send(sndpkt ) 
rdt_rev(revpkt )é& 
~s (corrupt (revpkt)| | 
re isWAK (revpkt )) 
Wait for Wait for udt_send(sndpkt) 
call 0 from ACK or 
above NAKO 
rdt_rev(revpkt) rdt_rev(revpkt) 
&& notcorrupt(rcvpkt) 66 notcorrupt(revpkt) 
66 isACK(rovpkt) 66 isACK( rovpkt) 
a 
Wait for Wait for 
ACK or call 1 from 
NAK1 above 
rdt_rov(revpkt) 66 
(corrupt (revpkt) | | 
isNAK(revpkt )) 
udt_send (sndpkt ) rdt_send(data) 





sndpkt--make_pkt (1,data,checksum) 
udt_send( sndpkt ) 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 11 


Module — 2: Transport Layer 





Receiver: 


rdt_rev(revpkt) && notcorrupt (revpkt) 
&6 has_seq0(revpkt) 
extract(revpkt, data) 
deliver_data(data) 
sndpkt=make_pkt (ACK, checksum) 


aeeleer Gore) udt_send( sndpkt ) rdt_rev(revpkt) 6& corrupt (revpkt) 


66 corrupt (r* kt % Ft 
pt (revpkt) a - sndpkt=make_pkt(NAK, checksum) 
sndpkt-make pkt(NAK, checksum) udt_send( sndpkt) 


udt_send( sndpkt) 
Wait for Wait for 
0 from 1 pai 


rdt_rcv(revpkt)6& notcorrupt below 
(rowpkt) sthas_seql(revpkt) rdt_rev(revpkt )&& notcorrupt 
(rovpkt)s6has_seqO(revpkt) 


andpkt-make_pkt(ACK, checksum) sndpkt=make_pkt(ACK,checksum) 
udt_send( sndpkt) udt_send(sndpkt) 


rdt_rov(revpkt) && notcorrupt(revpkt) 
&& has_seql (rcvpkt) 





extract (revpkt, data) 
deliver_data(data) 
sndpkt=make_pkt (ACK, checksum) 
udt_send( sndpkt) 


Duplicate ACK 

+ When an out-of-order packet is received, the receiver sends a positive acknowledgment for 
the packet it has received. 

+ When a corrupted packet is received, the receiver sends a negative acknowledgment. We can 
accomplish the same effect as a NAK if, instead of sending a NAK, we send an ACK for the 
last correctly received packet. 

+ A sender that receives two ACKs for the same packet (that is, receives duplicate ACKs) 


knows that the receiver did not correctly receive the packet following the packet that is being 
ACKed twice. 


Sender: 
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rdt_send(data) 





sndpkt=make_pkt(0, data, checksum) 
udt_send(sndpkt) 


rét_rev(revpkt) 66 
(corrupt (revpkt )| | 






ore isACK(revpkt, 1) ) 
Wait for W udt_send(sndpkt ) 
AO trp AEG - 
above 
rdt_rev(revpkt) rdt_rev(revpkt) 

&& notcorrupt(revpkt ) && notcorrupt( revpkt) 
&& isACK(revpkt, 1) BE AsACK(rcvpkt, 0) 
A A 

‘Wait for 
pea ioe call 1 from 
Bes above 


rdt_rev(revpkt) && 
(corrupt (revpkt )| | 
isACK( revpkt ,0)) 


udt_send(sndpkt ) 





sndpkt=make_pkt(1,data, checksum) 
uudt_send( snapkt) 








Receiver: 
rdt_rev(revpkt) && notcorrupt(revpkt) 
56 has_seq0(revpkt) 
extract (revpkt, data) 
deliver_data(data) rdt_rev(revpkt) 66 
sndpkt=make_pkt (ACK, 0, checksum) (corrupt (revpkt) | | 
udt_send( sndpkt ) has_seq0(revpkt )) 
a — SS sndpkt=make_pkt (ACK,0,ch) 
‘ udt_send(sndpkt ) 
‘Wait for ‘Wait for 
rdt_rev(revpkt) && 0 from 1 from 
(corrupt(revpkt) || below below 


has_seql(revpkt) ) 


sndpkt=make_pkt(ACK,1,checksum) 
udt_send(sndpkt) 


rdt_rev(revpkt) && notcorrupt(revpkt) 
66 has_segl (revpkt) 





extract (revpkt, data) 
deliver _data(data) 
sndpkt=make_pkt (ACK, 1, checksum) 
udt_send( sndpkt ) 


Reliable Data Transfer over a Lossy Channel with Bit Errors: rdt3.0 
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e Suppose now that in addition to corrupting bits, the underlying channel can lose packets as 
well. 

¢ Two additional concerns must now be addressed by the protocol: how to detect packet loss 
and what to do when packet loss occurs. 

© Suppose that the sender transmits a data packet and either that packet, or the receiver’s ACK 
of that packet, gets lost. In either case, no reply is forthcoming at the sender from the 
receiver. If the sender is willing to wait long enough so that it is certain that a packet has 
been lost, it can simply retransmit the data packet. 

e But how long must the sender wait to be certain that something has been lost? The sender 
must clearly wait at least as long as a round-trip delay between the sender and receiver plus 
whatever amount of time is needed to process a packet at the receiver. 

e The approach thus adopted in practice is for the sender to judiciously choose a time value 
such that packet loss is likely, although not guaranteed, to have happened. If an ACK is not 


received within this time, the packet is retransmitted. 


rdt_send(data) 
sndpkt=make_pkt (0 data, checksum) 


udt_send(sndpkt ) rdt_rev(revpkt) && 
start_timer (corrupt (revpkt) | | 


isACK (revpkt,1)) 
ee 
rdt_rev(revpkt) 
A timeout 
Cait peattog udt_send(sndpkt) 
ACK O ee 


above start_timer 





rdt_rev(revpkt) 
&& notcorrupt( revpkt) 
&& isACK( revpkt, 1) rdt_rev(revpkt) 
&& notcorrupt (revpkt) 


stop_timer && isACK(rcvpkt, 0) 


stop_timer 


timeout Wait for 
udt_send(sndpkt) Wait for at 1 from 
start_timer ees 

rdt_rev(revpkt) 


rdt_rev( rcevpkt) && CU >) A 
(corrupt (revpkt )| | 
isACK(revpkt ,0) ) rdtsenddatay 


A sndpkt=make_pkt (1,data,checksum) 


udt_send(sndpkt ) 
start_timer 
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Following figures depicts various scenarios: 


Sender Receiver 
send pkt0 to 
rev pkt0 
pose send ACKO 
rev ACKO 
send pktl key 
rev pktl 
a send ACK1 
rev ACKL 
send pkt0 to 
rev pkto 
awe send ACKO 
a. Operation with no loss 
Sender Receiver 
send pkt0 oe 
rev pkto 
we send ACKO 
rev ACKO Pkey 
send pkt1 oats eee 
pot send ACKL 
(los) x 
timeout 
resend pkt1 t7 
rev pktl 
pod (detect 
a duplicate) 
rev ACK1 send ACK1 
send pkto - ~Att0 
rev pkto 
oy send ACKO 
c. Lost ACK 


Sender Receiver 
send pkt0 — 
rev pkto 
- send ACKO 
rev ACKO Pkey 
send pktl SS x (iss 
timeout 
resend pktl 7 


[ 


4! rev pktl 


4 
gle send ACKL 
rev ACK1 
send pkto : ~Skto 
rev pkto 
pod send ACKO 
b. Lost packet ro 
Sender Receiver 
send pkto ; Lkto 
rev pkto 
pot. = send acko 
rev ACKO a 
send al — 
rev pktl 
timeout | = _p¢, send ACK1 
resend pkti mc 
rev pkt 1 
rev ACK1 pOkLe | (detect duplicate) 
send pkto ol send ACKl 
rev ACKL rev pkto 
do nothing ws send ACKO 


d. Premature timeout 


2.4.2 Pipelined Reliable Data Transfer Protocols 


Let us consider two hosts located at two different locations. 


The speed-of-light round-trip propagation delay between these two end systems, RTT, is 


approximately 30 milliseconds. Suppose that they are connected by a channel with a 
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transmission rate, R, of 1 Gbps (10° bits per second). With a packet size, L, of 1,000 bytes (8,000 
bits) per packet, including both header fields and data, the time needed to 
actually transmit the packet into the 1 Gbps link is 


= 8 microseconds 





Data packet Data packets 





a. A stop-and-wait protocol in operation b. A pipelined protocol in operation 
Above Figure (a) shows that with our stop-and-wait protocol, if the sender begins sending the 
packet at t = 0, then at t= L/R = 8 microseconds, the last bit enters the channel at the sender side. 
The packet then makes its 15-msec cross-country journey, with the last bit of the packet 
emerging at the receiver at t= RTT/2 + L/R = 15.008 msec. 
Assuming for simplicity that ACK packets are extremely small (so that we can ignore their 
transmission time) and that the receiver can send an ACK as soon as the last bit of a data packet 
is received, the ACK emerges back at the sender at t= RTT + L/R = 30.008 msec. 
At this point, the sender can now transmit the next message. Thus, in 30.008 msec, the sender 
was sending for only 0.008 msec. If we define the utilization of the sender (or the channel) as the 
fraction of time the sender is actually busy sending bits into the channel, the analysis shows that 
the stop-and-wait protocol has a rather dismal sender utilization, Usender, of 


L/R 008 


=——= 2 
30.008 oe 


sender = 





That is, the sender was busy only 2.7 hundredths of one percent of the time! 


The solution to this particular performance problem is simple: Rather than operate in a stop-and- 
wait manner, the sender is allowed to send multiple packets without waiting for 


acknowledgments. if the sender is allowed to transmit three packets before having to wait 
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for acknowledgments. 

Since the many in-transit sender-to-receiver packets can be visualized as filling a pipeline, this 
technique is known as pipelining. Pipelining has the following consequences for reliable data 
transfer protocols: 

+ The range of sequence numbers must be increased, since each in-transit packet (not counting 
retransmissions) must have a unique sequence number and there may be multiple, in-transit, 
unacknowledged packets. 

+ The sender and receiver sides of the protocols may have to buffer more than one packet. 
Minimally, the sender will have to buffer packets that have been transmitted but not yet 
acknowledged. Buffering of correctly received packets may also be needed at the receiver, as 
discussed below. 

+ The range of sequence numbers needed and the buffering requirements will depend on the 
manner in which a data transfer protocol responds to lost, corrupted, and overly delayed packets. 
Two basic approaches toward pipelined error recovery can be identified: Go-Back-N and 


selective repeat. 


Sender Receiver 


First bit of first packet 
transmitted, t = 0 


Last bit of first packet es 
transmitted, t= UR 


'— First bit of first packet arrives 


RTT- — Last bit of first packet arrives, send ACK 


ACK arrives, send next inl 


t=RIT+UR 





a. Stop-and-walt operation 
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Sender Recelver 


First bit of first packet e oe 


transmitted, t = 0 
Last bit of first packet 












transmitted, t= U/R 


First bit of first packet arrives 

— Last bit of first packet arrives, send ACK 
— Last bit of 2nd packet arrives, send ACK 
— Last bit of 3rd packet arrives, send ACK 


ACK arrives, send next packet, 
t=RIT+UR 


b. Pipelined operation 


2.4.3 Go-Back-N (GBN) 

e In a Go-Back-N (GBN) protocol, the sender is allowed to transmit multiple packets (when 
available) without waiting for an acknowledgment, but is constrained to have no more than 
some maximum allowable number, N, of unacknowledged packets in the pipeline. 

e If base is the sequence number of the oldest unacknowledged packet and nextseqnum is the 
smallest unused sequence number (that is, the sequence number of the next packet to be 
sent). Sequence numbers in the interval [0,base-1] correspond to packets that have already 
been transmitted and acknowledged. 

e The interval [base,nextseqnum-1] corresponds to packets that have been sent but not yet 
acknowledged. 

e Sequence numbers in the interval [nextseqnum,base+N-1] can be used for packets that can be 


sent immediately, should data arrive from the upper layer. 


next seqnum 


Key: 


ono (os Des 


hereto [| Not usable 















































Hs 
Window size wes 
N 


e Sequence numbers greater than or equal to base+N cannot be used until an unacknowledged 


packet currently in the pipeline has been acknowledged. 
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e The range of permissible sequence numbers for transmitted but not yet acknowledged 
packets can be viewed as a window of size N over the range of sequence numbers. N is often 
referred to as the window size and the GBN protocol itself as a sliding-window protocol. 

e  Ifk is the number of bits in the packet sequence number field, the range of sequence numbers 
is thus [0,2k — 1]. With a finite range of sequence numbers, all arithmetic involving sequence 
numbers must then be done using modulo 2k arithmetic. 


rdt_send (data) 





if(nextsegnum<base+N ){ 
sndpkt[ nextseqnum }=make_pkt(nextseqnum, data, checksum) 
udt_send(sndpkt[nextseqnum]}) 
if(base=nextseqnum) 
start_timer 
A 











nextseqnum++ 
base=1 = } 
nextseqnuml 7 else 
~s refuse_data(data) 
ee: ( ) timeout 
‘ 





start_timer 
udt_send(sndpkt [base ]) 
udt_send(sndpkt [base+1]) 


Wait 
rdt_rev(revpkt ) && corrupt( revpkt) oe 
ee ( j udt_send(sndpkt (next seqnum-1} ) 


rdt_rev(revpkt) && notcorrupt(revpkt) 





base=getacknum( revpkt) +1 
IT £(base==next seqnum) 
stop_timer 
else 
start_timer 


GBN Sender 


rdt_rev(revpkt) 
&& notcorrupt( revpkt) 
66 hasseqnum(rcvpkt, expectedseqnum) 





extract(revpkt, data) 
deliver_data(data) 

sndpkt=make_pkt (expectedseqnum, ACK, checksum) 
udt_send(sndpkt ) 


expectedseqnum++ 
default 
aa SE > Wait SS 
A udt_send(sndpkt) 
expectedseqnum=1 


sndpkt=make_pkt (0, ACK, checksum) 


GBN Receiver 
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The GBN sender must respond to three types of events: 


Invocation from above. When rdt_send() is called from above, the sender first checks to see 
if the window is full, that is, whether there are N outstanding, unacknowledged packets. If 
the window is not full, a packet is created and sent, and variables are appropriately updated. 
If the window is full, the sender simply returns the data back to the upper layer, an implicit 
indication that the window is full. The upper layer would presumably then have to try again 
later. In a real implementation, the sender would more likely have either buffered (but not 
immediately sent) this data, or would have a synchronization mechanism that would allow 
the upper layer to call rdt_send() only when the window is not full. 

Receipt of an ACK. In our GBN protocol, an acknowledgment for a packet with sequence 
number n will be taken to be a cumulative acknowledgment, indicating that all packets with a 


sequence number up to and including n have been correctly received at the receiver. We'll 





come back to this issue shortly when we examine the receiver side of GBN. 

A timeout event. The protocol’s name, “Go-Back-N,” is derived from the sender’s behavior 
in the presence of lost or overly delayed packets. As in the stop-and-wait protocol, a timer 
will again be used to recover from lost data or acknowledgment packets. If a timeout occurs, 
the sender resends all packets that have been previously sent but that have not yet been 
acknowledged. If an ACK is received but there are still additional transmitted but not yet 


acknowledged packets, the timer is restarted. If there are no outstanding, unacknowledged 


packets, the timer is stopped. 


Receiver: 


If a packet with sequence number n is received correctly and is in order, the receiver sends an 
ACK for packet n and delivers the data portion of the packet to the upper layer. In all other 
cases, the receiver discards the packet and resends an ACK for the most recently received in- 
order packet. 

In our GBN protocol, the receiver discards out-of-order packets. GBN discard a correctly 
received but out-of-order packet. 

Suppose now that packet n is expected, but packet n + | arrives. Because data must be 
delivered in order, the receiver could buffer (save) packet n + | and then deliver this packet 


to the upper layer after it had later received and delivered packet n. However, if packet n is 
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lost, both it and packet n + 1 will eventually be retransmitted as a result of the GBN 


retransmission rule at the sender. Thus, the receiver can simply discard packet n + 1. 


2.4.4 Selective Repeat (SR) 





Limitation of GBN: GBN itself suffers from performance problems. In particular, when the 
window size and bandwidth-delay product are both large, many packets can be in the 
pipeline. A single packet error can thus cause GBN to retransmit a large number of packets. 
As the name suggests, selective-repeat protocols avoid unnecessary retransmissions by 
having the sender retransmit only those packets that it suspects were received in error (that is, 
were lost or corrupted) at the receiver. 

This individual, as needed, retransmission will require that the receiver individually 
acknowledge correctly received packets. 

A window size of N will again be used to limit the number of outstanding, unacknowledged 
packets in the pipeline. 

The SR receiver will acknowledge a correctly received packet whether or not it is in order. 
Out-of-order packets are buffered until any missing packets (that is, packets with lower 
sequence numbers) are received, at which point a batch of packets can be delivered in order 
to the upper layer. 


send base nextseqnum 
Key. 


ILOMEQNEOOEOOEECCOoO0 peer ae 


Sent, nat 

- Not usable 
window size yet Act's U 
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a. Sender view of sequence numbers 
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b. Receiver view of sequence numbers 
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1. Data received from above. When data is received from above, the SR sender 
checks the next available sequence number for the packet. If the sequence 
number is within the sender's window, the data is packetized and sent; other- 
wise it is either buffered or returned to the upper layer for later transmission, 
as in GBN. 

2. Timeout. Timers are again used to protect against lost packets. However. each 
packet must now have its own logical timer, since only a single packet will 
be transmitted on timeout. A single hardware timer can be used to mimic the 
operation of multiple logical timers [Varghese 1997]. 

3. ACK received. If an ACK is received, the SR sender marks that packet as 
having been received, provided it is in the window. If the packet’s sequence 
number is equal to send_base, the window base is moved forward to the 
unacknowledged packet with the smallest sequence number. If the window 
moves and there are untransmitted packets with sequence numbers that now 
fall within the window, these packets are transmitted. 











SR sender events and actions 


Sender Receiver 
pkt0 sent 
0123456789 pa 
pkt1 sent pkt0 revd, delivered, ACKO sent 
0123456789 0123456789 
[—pkt2 sent pktl revd, delivered, ACK1 sent 
0123456789 a? 0123456789 


(loss) 


pkt3 sent, window full 
0123456789 


pkt3 revd, buffered, ACK3 sent 
ACKO revd, pkt4 sent 0123456789 
0123456789 


ACK1 revd, pkt5 sent pkt4 revd, buffered, ACK4 sent 
0123456789 o123456789 


pkt5 revd; buffered, ACK5 sent 


L_pkt2 TIMEOUT, pkt2 0123456789 
resent 


0123456789 





pkt2 revd, pkt2,pkt3,pkt4, pkt5 
delivered, ACK2 sent 


0123456789 


ACK3 revd, nothing sent 
0123456789 


4 
< 
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1. Packet with sequence number in [rcv_base, rcev_base+N-1] is cor- 
rectly received. In this case, the received packet falls within the receiver's win- 
dow and a selective ACK packet is returned to the sender. If the packet was not 
previously received, it is buffered. If this packet has a sequence number equal to 
the base of the receive window (rcv_base in Figure 3.22), then this packet, 
and any previously buffered and consecutively numbered (beginning with 
rev_base) packets are delivered to the upper layer. The receive window is 
then moved forward by the number of packets delivered to the upper layer. As 
an example, consider Figure 3.26. When a packet with a sequence number of 
rcev_base=2 is received, it and packets 3, 4, and 5 can be delivered to the 
upper layer. 

. Packet with sequence number in [rcv_base-N, rev_base-1] is cor- 
rectly received. In this case, an ACK must be generated, even though this is a 
packet that the receiver has previously acknowledged. 

3. Otherwise. Ignore the packet. 


ie) 











SR receiver events and actions 


Summary of reliable data transfer mechanisms and their use 











Mechanism Use, Comments 
Checksum Used to detect bit errors in a transmitted pocket. 
Timer Used to timeout/retransmit a packet, possibly because the packet (or its ACK) wos 


lost within the channel. Becouse timeouts can occur when a packet is delayed but 
not lost (premature timeout), or when a packet has been received by the receiver 
but the receiver-to-sender ACK hes been lost, duplicate copies of « packet may be 
received by a receiver. 

‘Sequence number Used for sequential numbering of packets of data flowing from sender to receiver. 
Gops in the sequence numbers of received packets allow the receiver to detect a 
lost packet. Packets with duplicate sequence numbers allow the receiver to detect 
duplicate copies of a packet. 

‘Acknowledgment Used by the receiver to tell the sender that a packet or set of packets has been 
received correctly. Acknowledgments will typically carry the sequence number of the 
pocket or packets being acknowledged. Acknowledgments may be individual or 
cumulative, depending on the protocol. 

Negative acknowledgment Used by the receiver to tell the sender that 0 packet has not been received correct: 
ly. Negative acknowledgments will typically carry the sequence number of the pack- 
et that wos not received correctly. 

Window, pipelining The sender may be restricted to sending only packets with sequence numbers that 
fall within a given ronge. By allowing multiple packets to be transmitted but not yet 
acknowledged, sender utilization con be increased over a stop-and-woit mode of 
operation. We'll see shortly that the window size may be set on the basis of the 
receiver's ability to receive ond buffer messages, or the level of congestion in the 
network, or both. 
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2.5 Connection-Oriented Transport: TCP 
2.5.1 The TCP Connection 


TCP is said to be connection-oriented because connection has to be established between two 
application processes before they start transmitting data. 

As part of TCP connection establishment, both sides of the connection will initialize many 
TCP state variables associated with the TCP connection. 

A TCP connection provides a full-duplex service: If there is a TCP connection between 
Process A on one host and Process B on another host, then application layer data can flow 
from Process A to Process B at the same time as application layer data flows from Process B 
to Process A. 

A TCP connection is also always point-to-point, that is, between a single sender and a single 
receiver. Multicasting is not allowed. 

Once a TCP connection is established, the two application processes can send data to each 
other. 

Let’s consider the sending of data from the client process to the server process. The client 
process passes a stream of data through the socket 


Once the data passes through the door, the data is in the hands of TCP running in the client. 


ae Process rina.) 


writes data reads data 











Te [Segment — [Segment 








To 
send receive 
buffer buffer 




















TCP directs this data to the connection’s send buffer, which is one of the buffers that is set 
aside during the initial three-way handshake. 

From time to time, TCP will grab chunks of data from the send buffer and pass the data to the 
network layer. 

The maximum amount of data that can be grabbed and placed in a segment is limited by the 


maximum segment size (MSS). 
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e The MSS is typically set by first determining the length of the largest link-layer frame that 
can be sent by the local sending host (maximum transmission unit, MTU), and then setting 
the MSS to ensure that a TCP segment plus the TCP/IP header length will fit into a single 
link-layer frame. 

e TCP pairs each chunk of client data with a TCP header, thereby forming TCP segments. The 
segments are passed down to the network layer, where they are separately encapsulated 
within network-layer IP datagrams. 

e The IP datagrams are then sent into the network. 

¢ When TCP receives a segment at the other end, the segment’s data is placed in the TCP 
connection’s receive buffer. 

e The application reads the stream of data from this buffer. 


e Each side of the connection has its own send buffer and its own receive buffer. 


2.5.2 TCP Segment Structure 


32bits 
L 





Source port # Dest port # 
Sequence number 


‘Acknowledgment number 


Header 


zhez2 
length g 


Unused. 28 ESEz Receive window 


Internet checksum, Urgent data pointer 


Options 
Data 


e The TCP segment consists of header fields and a data field. 

e The data field contains a chunk of application data. 

¢ The minimum length of TCP header is 20 bytes. 

e The header includes source and destination port numbers, which are used for 
multiplexing/demultiplexing data from/to upper-layer applications. 


e The header includes a checksum field for error detection. 
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e A TCP segment header also contains the following fields: 


The 32-bit sequence number field and the 32-bit acknowledgment number field 
are used by the TCP sender and receiver in implementing a reliable data transfer 
service. 

The 16-bit receive window field is used for flow control. It is used to indicate the 
number of bytes that a receiver is willing to accept. 

The 4-bit header length field specifies the length of the TCP header in 32-bit words. 
The TCP header can be of variable length due to the TCP options field. 

The optional and variable-length options field is used when a sender and receiver 
negotiate the maximum segment size (MSS) or as a window scaling factor for use in 
high-speed networks. 

The flag field contains 6 bits. 

The ACK bit is used to indicate that the value carried in the acknowledgment field is 
valid; that is, the segment contains an acknowledgment for a segment that has been 
successfully received. 

The RST, SYN, and FIN bits are used for connection setup and teardown. 

Setting the PSH bit indicates that the receiver should pass the data to the upper layer 
immediately. 

Finally, the URG bit is used to indicate that there is data in this segment that the 
sending-side upper-layer entity has marked as “urgent.” 

The location of the last byte of this urgent data is indicated by the 16-bit urgent data 
pointer field. TCP must inform the receiving- side upper-layer entity when urgent 


data exists and pass it a pointer to the end of the urgent data. 


Sequence Numbers and Acknowledgment Numbers 


The sequence number for a segment is the byte-stream number of the first byte in the segment. 


Example: Suppose that a process in Host A wants to send a stream of data to a process in Host B 


over a TCP connection. The TCP in Host A will implicitly number each byte in the data stream. 


Suppose that the data stream consists of a file consisting of 500,000 bytes, that the MSS is 1,000 


bytes, and that the first byte of the data stream is numbered 0. TCP constructs 500 segments out 


of the data stream. The first segment gets assigned sequence number 0, the second segment gets 


assigned sequence number 1,000, the third segment gets assigned sequence number 2,000, and so 
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on. Each sequence number is inserted in the sequence number field in the header of the 


appropriate TCP segment. 


The acknowledgment number that Host A puts in its segment is the sequence number of the 
next byte Host A is expecting from Host B. 

Example: Suppose that Host A has received all bytes numbered 0 through 535 from B and 
suppose that it is about to send a segment to Host B. Host A is waiting for byte 536 and all the 
subsequent bytes in Host B’s data stream. So Host A puts 536 in the acknowledgment number 


field of the segment it sends to B. 


Telnet: A Case Study for Sequence and Acknowledgment Numbers 

Telnet is a popular application-layer protocol used for remote login. It runs over TCP and is 
designed to work between any pair of hosts. 

Suppose Host A initiates a Telnet session with Host B. Because Host A initiates the session, it is 
labeled the client, and Host B is labeled the server. Each character typed by the user (at the 
client) will be sent to the remote host; the remote host will send back a copy of each character, 
which will be displayed on the Telnet user’s screen. This “echo back” is used to ensure that 
characters seen by the Telnet user have already been received and processed at the remote site. 
Each character thus traverses the network twice between the time the user hits the key and the 


time the character is displayed on the user’s monitor. 
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Host A Host B 


User types 
vor 


Host ACKs 
receipt of 'c', 
echoes back ‘c* 





Host ACKs 
receipt of 
echoed 'c’ 


Time Time 


2.5.3 Estimating the Round-Trip Time 

The sample RTT, denoted SampleRTT, for a segment is the amount of time between when the 
segment is sent and when an acknowledgment for the segment is received. 

Instead of measuring a SampleRTT for every transmitted segment, most TCP implementations 
take only one SampleRTT measurement at a time. That is, at any point in time, the SampleRTT 
is being estimated for only one of the transmitted but currently unacknowledged segments, 
leading to a new value of SampleRTT approximately once every RTT. 

The SampleRTT values will fluctuate from segment to segment due to congestion in the routers 
and to the varying load on the end systems. 

In order to estimate a typical RTT, it is therefore natural to take some sort of average of the 
SampleRTT values. TCP maintains an average, called EstimatedRTT, of the SampleRTT values. 
EstimatedRTT = (1 — a) + EstimatedRTT + a - SampleRTT 

Here a = 0.125 


In statistics, This kind of average is called an exponential weighted moving average (EWMA). 
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In addition to having an estimate of the RTT, it is also valuable to have a measure of the 

variability of the RTT : DevRTT. 

DevRTT, is an estimate of how much SampleRTT typically deviates from EstimatedRTT: 
DevRTT = (1 — B) + DevRTT + B-| SampleRTT — EstimatedRTT | 


Here f = 0.25 
Now Timeout can be calculated as: 


TimeoutInterval = EstimatedRTT + 4 - DevRTT 


2.5.4 Reliable Data Transfer 


e TCP creates a reliable data transfer service on top of IP’s unreliable best effort service. 

e TCP’s reliable data transfer service ensures that the data stream that a process reads out of its 
TCP receive buffer is uncorrupted, without gaps, without duplication, and in sequence; that 
is, the byte stream is exactly the same byte stream that was sent by the end system on the 
other side of the connection. 


NextSeqNum=InitialSeqNumber 
SendBase=InitialSeqNumber 


loop (forever) { 
switch(event) 


event: data received from application above 
create TCP segment with sequence number NextSeqNum 
if (timer currently not running) 
start timer 
pass segment to IP 
Next SeqNum=NextSeqNum+length (data) 
break; 


event: timer timeout 
retransmit not-yet-acknowledged segment with 
smallest sequence number 
start timer 
break; 


event: ACK received, with ACK field value of y 
if (y > SendBase) { 
SendBase=y 
if (there are currently any not-yet-acknowledged segments) 
start timer 
} 


break; 


} /* end of loop forever */ 
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A Few Interesting Scenarios: 

1. Host A sends one segment to Host B. Suppose that this segment has sequence number 92 and 
contains 8 bytes of data. After sending this segment, Host A waits for a segment from B with 
acknowledgment number 100. Although the segment from A is received at B, the 
acknowledgment from B to A gets lost. In this case, the timeout event occurs, and Host A 
retransmits the same segment. Of course, when Host B receives the retransmission, it 
observes from the sequence number that the segment contains data that has already been 


received. Thus, TCP in Host B will discard the bytes in the retransmitted segment. 


Host A Host B 


Timeout 





2 Host A sends two segments back to back. The first segment has sequence number 92 and 8 
bytes of data, and the second segment has sequence number 100 and 20 bytes of data. 
Suppose that both segments arrive intact at B, and B sends two separate acknowledgments 
for each of these segments. The first of these acknowledgments has acknowledgment number 
100; the second has acknowledgment number 120. Suppose now that neither of the 
acknowledgments arrives at Host A before the timeout. When the timeout event occurs, Host 
A resends the first segment with sequence number 92 and restarts the timer. As long as the 
ACK for the second segment arrives before the new timeout, the second segment will not be 


retransmitted. 
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Host A Host B 






seq=92 timeout interval 


seq=92 timeout interval— 





Time Time 


3. Host A sends the two segments, exactly as in the second example. The acknowledgment of 
the first segment is lost in the network, but just before the timeout event, Host A receives an 
acknowledgment with acknowledgment number 120. Host A therefore knows that Host B has 
received everything up through byte 119; so Host A does not resend either of the two 


segments. 


Host B 





Seq=92 timeout interval 
eq: imeout intery pcret20 





Time Time 
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Doubling the Timeout Interval 

Each time TCP retransmits, it sets the next timeout interval to twice the previous value, rather 
than deriving it from the last EstimatedRTT and DevRTT. 

For example, suppose Timeout Interval associated with the oldest not yet acknowledged segment 
is 0.75 sec when the timer first expires. TCP will then retransmit this segment and set the new 
expiration time to 1.5 sec. If the timer expires again 1.5 sec later, TCP will again retransmit this 


segment, now setting the expiration time to 3.0 sec. 


Fast Retransmit 

One of the problems with timeout-triggered retransmissions is that the timeout period can be 
relatively long. When a segment is lost, this long timeout period forces the sender to delay 
resending the lost packet, thereby increasing the end-to end delay. Fortunately, the sender can 
often detect packet loss well before the timeout event occurs by noting so-called duplicate 
ACKs. A duplicate ACK is an ACK that reacknowledges a segment for which the sender has 


already received an earlier acknowledgment. 








Event TCP Receiver Action 
Anivol of in-order segment with expected sequence number. All Delayed ACK. Wait up to 500 msec for arrival of another in-order seg- 
dato up to expected sequence number already acknowledged. ment. If next in-order segment does not arrive in this interval, send on ACK. 





‘Anivol of in-order segment with expected sequence number. One Immediately send single cumulative ACK, ACKing both in-order segments. 
other in-order segment waifing for ACK transmission. 





Anivol of out-oForder segment with higher-than-expected sequence Immediately send duplicate ACK, indicating sequence number of next 





number. Gop detected. expected byte (which is the lower end of the gop). 
Anivol of segment that partially or completely fils in gap in Immediately send ACK, provided that segment starts at the lower end 
received dato. of gop. 





When a TCP receiver receives a segment with a sequence number that is larger than the next, 
expected, in-order sequence number, it detects a gap in the data stream—that is, a missing 
segment. This gap could be the result of lost or reordered segments within the network. 

Because a sender often sends a large number of segments back to back, if one segment is lost, 
there will likely be many back-to-back duplicate ACKs. If the TCP sender receives three 
duplicate ACKs for the same data, it takes this as an indication that the segment following the 


segment that has been ACKed three times has been lost. In the case that three duplicate ACKs 
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are received, the TCP sender performs a fast retransmit, retransmitting the missing segment 


before that segment’s timer expires. 


event: ACK received, with ACK field value of y 
if (y > SendBase) { 
SendBase=y 
if (there are currently any not yet 
acknowledged segments) 
start timer 
} 
else { /* a duplicate ACK for already ACKed 
segment */ 
increment number of duplicate ACKs 
received for y 
if (number of duplicate ACKS received 
for y==3) 
/* TCP fast retransmit +/ 
resend segment with sequence number y 


break; 


2.5.5 Flow Control 


e TCP provides a flow-control service to its applications to eliminate the possibility of the 
sender overflowing the receiver’s buffer. 
e Flow control is a speed-matching service—matching the rate at which the sender is sending 
against the rate at which the receiving application is reading. 
e TCP provides flow control by having the sender maintain a variable called the receive 
window. 
e Informally, the receive window is used to give the sender an idea of how much free buffer 
space is available at the receiver. 
e Suppose that Host A is sending a large file to Host B over a TCP connection. Host B 
allocates a receive buffer to this connection; denote its size by RevBuffer. 
e From time to time, the application process in Host B reads from the buffer. Define the 
following variables: 
= LastByteRead: the number of the last byte in the data stream read from the buffer by 
the application process in B 
= LastByteRcvd: the number of the last byte in the data stream that has arrived from the 


network and has been placed in the receive buffer at B 
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rwnd 


+> 





Data Application 
from IP process 
ee! prec 


Because TCP is not permitted to overflow the allocated buffer, we must have 
LastByteRcvd — LastByteRead RevBuffer 
The receive window, denoted rwnd is set to the amount of spare room in the buffer: 


rwnd = RevBuffer — [LastByteRcevd — LastByteRead] 


Host B tells Host A how much spare room it has in the connection buffer by placing its current 
value of rwnd in the receive window field of every segment it sends to A. Initially, Host B sets 


rwnd = RevBuffer. 


Host A in turn keeps track of two variables, LastByteSent and Last- ByteAcked. The difference 
between these two variables, LastByteSent — LastByteAcked, is the amount of unacknowledged 
data that A has sent into the connection. By keeping the amount of unacknowledged data less 
than the value of rwnd, Host A is assured that it is not overflowing the receive buffer at Host B. 
Thus, Host A makes sure throughout the connection’s life that 


LastByteSent — LastByteAcked rwnd 


2.5.6 TCP Connection Management 


TCP has 3 phases 
1) Connection Establishment phase 
2) Data transmission phase 


3) Connection Termination phase 
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Connection establishment phase: 

Step 1: The client-side TCP first sends a special TCP segment to the server-side TCP. This 
special segment contains no application-layer data. But one of the flag bits in the segment’s 
header, the SYN bit, is set to 1. For this reason, this special segment is referred to as a SYN 
segment. In addition, the client randomly chooses an initial sequence number (client_isn) and 
puts this number in the sequence number field of the initial TCP SYN segment. 

Step 2: Once the IP datagram containing the TCP SYN segment arrives at the server host the 
server extracts the TCP SYN segment from the datagram, allocates the TCP buffers and 
variables to the connection, and sends a connection-granted segment to the client TCP. This 
connection-granted segment also contains no application layer data. However, it does contain 
three important pieces of information in the segment header. First, the SYN bit is set to 1. 
Second, the acknowledgment field of the TCP segment header is set to client_isn+1. Finally, the 
server chooses its own initial sequence number (server_isn) and puts this value in the sequence 
number field of the TCP segment header. This is referred as SYNACK segment. 

Step 3: Upon receiving the SYNACK segment, the client also allocates buffers and variables to 
the connection. The client host then sends the server yet another segment; this last segment 
acknowledges the server’s connection-granted segment. The SYN bit is set to zero, since the 
connection is established. This third stage of the three-way handshake may carry client-to-server 
data in the segment payload. 


lent host Server host 


li 








Connection 
request —— 
— Connection 
granted 
ACK 
Time Time 
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Connection Termination phase: 

Either of the two processes participating in a TCP connection can end the connection. When a 
connection ends, the “resources” (that is, the buffers and variables) in the hosts are deallocated. 
For connection termination TCP sends segment with FIN flag set to 1. When the server receives 
this segment, it sends the client an acknowledgment segment in return. The server then sends its 
own shutdown segment, which has the FIN bit set to 1. Finally, the client acknowledges the 


server’s shutdown segment. At this point, all the resources in the two hosts are now deallocated. 





State transition diagram: 
Client 
Client application 
initiates a TCP connection 
cLoseD 
‘Wait 30 seconds, 
Send SYN 
‘TIME_WAIT Ba Gee 
Receive FIN, Receive SYN & ACK, 
send ACK send ACK 
pe ESTABLISHED 
‘Send FIN 
Receive ACK,” 
see FIN_NATE_1 “Client application 
Initiates close connection 
Server: 
Server application 
creates a listen socket 
Receive ACK, CLOSED 
send nothing 
LAST_ACK ~ees 
Receive SYN 
Seaton send SYN & ACK 
(CLOSE_WAIT s¥w_RevD 
Receive FIN, Receive ACK, 
send ACK send nothing 
ESTABLISHED 
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2.6 Principles of Congestion Control 


2.61The Causes and the Costs of Congestion 


Scenario 1: Two Senders, a Router with Infinite Buffers 


din: Original data out 







Host B 


> 


Unlimited shared 
output link buffers 


Two hosts (A and B) each have a connection that shares a single hop between source and 
destination. 

Let’s assume that the application in Host A is sending data into the connection at an average 
rate of Ain bytes/sec. 

These data are original in the sense that each unit of data is sent into the socket only once. 
The underlying transport-level protocol is a simple one. 

Data is encapsulated and sent; no error recovery (for example, retransmission), flow control, 
or congestion control is performed. 

Ignoring the additional overhead due to adding transport- and lower-layer header 
information, the rate at which Host A offers traffic to the router in this first scenario is thus 
hin bytes/sec. 

Host B operates in a similar manner, and we assume for simplicity that it too is sending at a 
rate of Ain bytes/sec. 

Packets from Hosts A and B pass through a router and over a shared outgoing link of 
capacity R. The router has buffers that allow it to store incoming packets when the packet- 
arrival rate exceeds the outgoing link’s capacity. In this first scenario, we assume that the 


router has an infinite amount of buffer space. 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 37 


Module — 2: Transport Layer 





e Below figure plots the performance of Host A’s connection under this first scenario. The left 
graph plots the per-connection throughput (number of bytes per second at the receiver) as a 


function of the connection-sending rate. 
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Scenario 2: Two Senders and a Router with Finite Buffers 
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e First, the amount of router buffering is assumed to be finite. A consequence of this real-world 
assumption is that packets will be dropped when arriving to an already full buffer. 

e Second, we assume that each connection is reliable. If a packet containing a transport-level 
segment is dropped at the router, the sender will eventually retransmit it. 

e Because packets can be retransmitted, we must now be more careful with our use of the term 


sending rate. 
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e Specifically, let us again denote the rate at which the application sends original data into the 


socket by Xin in bytes/sec. The rate at which the transport layer sends segments (containing 


original data and retransmitted data) into the network will be denoted ia in bytes/sec. ia in 


is sometimes referred to as the offered load to the network. 


Ri2: 


RI2- 


R34 
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a b. 
e We see here another cost of a congested network— the sender must perform retransmissions 
in order to compensate for dropped (lost) packets due to buffer overflow. 
e Here then is yet another cost of a congested network—unneeded retransmissions by the 
sender in the face of large delays may cause a router to use its link bandwidth to forward 


unneeded copies of a packet. 


Scenario 3: Four Senders, Routers with Finite Buffers, and Multihop Paths 

e We again assume that each host uses a timeout/retransmission mechanism to implement a 
reliable data transfer service, that all hosts have the same value of Nin in, and that all router 
links have capacity R bytes/sec. 

e Here we see yet another cost of dropping a packet due to congestion— when a packet is 
dropped along a path, the transmission capacity that was used at each of the upstream links to 


forward that packet to the point at which it is dropped ends up having been wasted. 
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2.6.2 Approaches to Congestion Control 

e End-to-end congestion control. In an end-to-end approach to congestion control, the 
network layer provides no explicit support to the transport layer for congestion control 
purposes. Even the presence of congestion in the network must be inferred by the end 
systems based only on observed network behavior 

e Network-assisted congestion control. With network-assisted congestion control, network- 
layer components (that is, routers) provide explicit feedback to the sender regarding the 
congestion state in the network. This feedback may be as simple as a single bit indicating 
congestion at a link. 

For network-assisted congestion control, congestion information is typically fed back from the 

network to the sender in one of two ways: 

Direct feedback may be sent from a network router to the sender. This form of notification 


typically takes the form of a choke packet. 
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The second form of notification occurs when a router marks/updates a field in a packet flowing 


from sender to receiver to indicate congestion. Upon receipt of a marked packet, the receiver 


then notifies the sender of the congestion indication. 
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2.6.3 Network-Assisted Congestion-Control Example: ATM ABR Congestion 
Control 


Fundamentally ATM takes a virtual-circuit (VC) oriented approach toward packet switching. 
ABR has been designed as an elastic data transfer service in a manner reminiscent of TCP. 
When the network is underloaded, ABR service should be able to take advantage of the spare 
available bandwidth; when the network is congested, ABR service should throttle its 


transmission rate to some predetermined minimum transmission rate. 
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¢ With ATM ABR service, data cells are transmitted from a source to a destination through a 
series of intermediate switches. 
e Interspersed with the data cells are resource-management cells (RM cells); these RM cells 
can be used to convey congestion-related information among the hosts and switches. 
e ABR provides three mechanisms for signaling congestion-related information from the 
switches to the receiver: 
= EFCI bit. Each data cell contains an explicit forward congestion indication (EFCI) bit. A 
congested network switch can set the EFCI bit in a data cell to 1 to signal congestion to 
the destination host. The destination must check the EFCI bit in all received data cells. 
When an RM cell arrives at the destination, if the most recently received data cell had the 
EFCI bit set to 1, then the destination sets the congestion indication bit (the CI bit) of the 
RM cell to 1 and sends the RM cell back to the sender. Using the EFCI in data cells and 
the CI bit in RM cells, a sender can thus be notified about congestion at a network switch. 
= Cl and NI bits. As noted above, sender-to-receiver RM cells are interspersed with data 
cells. The rate of RM cell interspersion is a tunable parameter, with the default value 
being one RM cell every 32 data cells. These RM cells have a congestion indication (CI) 
bit and a no increase (NI) bit that can be set by a congested network switch. Specifically, 
a switch can set the NI bit in a passing RM cell to 1 under mild congestion and can set the 
CI bit to 1 under severe congestion conditions. When a destination host receives an RM 
cell, it will send the RM cell back to the sender with its CI and NI bits intact 
= ER setting. Each RM cell also contains a 2-byte explicit rate (ER) field. A congested 
switch may lower the value contained in the ER field in a passing RM cell. In this 
manner, the ER field will be set to the minimum supportable rate of all switches on the 


source-to-destination path. 


2.7 TCP Congestion Control 
e The TCP congestion-control mechanism operating at the sender keeps track of an additional 
variable, the congestion window. The congestion window, denoted cwnd, imposes a 


constraint on the rate at which a TCP sender can send traffic into the network. Specifically, 
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the amount of unacknowledged data at a sender may not exceed the minimum of cwnd and 


rwnd, that is: 


LastByteSent — LastByteAcked = min{cwnd, rwnd} 


The sender’s send rate is roughly cwnd/RTT bytes/sec. By adjusting the value of cwnd, the 

sender can therefore adjust the rate at which it sends data into its connection. 

Congestion window can be adjusted with following principles: 

= A lost segment implies congestion, and hence, the TCP sender’s rate should be decreased 
when a segment is lost. 

= An acknowledged segment indicates that the network is delivering the sender’s segments 
to the receiver, and hence, the sender’s rate can be increased when an ACK arrives for a 
previously unacknowledged segment. 

= Bandwidth probing. Given ACKs indicating a congestion-free source-to-destination path 
and loss events indicating a congested path, TCP’s strategy for adjusting its transmission 
rate is to increase its rate in response to arriving ACKs until a loss event occurs, at which 
point, the transmission rate is decreased. The TCP sender thus increases its transmission 
rate to probe for the rate that at which congestion onset begins, backs off from that rate, 


and then to begins probing again to see if the congestion onset rate has changed. 


TCP Congestion control mechanisms: 


1) Slow Start 


When a TCP connection begins, the value of cwnd is typically initialized to a small value of 
1 MSS, resulting in an initial sending rate of roughly MSS/RTT. 

In the slow-start state, the value of cwnd begins at 1 MSS and increases by | MSS every time 
a transmitted segment is first acknowledged. 

In the below figure TCP sends the first segment into the network and waits for an 
acknowledgment. When this acknowledgment arrives, the TCP sender increases the 
congestion window by one MSS and sends out two maximum-sized segments. These 
segments are then acknowledged, with the sender increasing the congestion window by | 
MSS for each of the acknowledged segments, giving a congestion window of 4 MSS, and so 
on. This process results in a doubling of the sending rate every RTT. Thus, the TCP send rate 


starts slow but grows exponentially during the slow start phase. 
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Time Time 


e If there is a loss event (i.e., congestion) indicated by a timeout, the TCP sender sets the value 
of cwnd to | and begins the slow start process anew. 

e It also sets the value of a second state variable, ssthresh to cwnd/2—half of the value of the 
congestion window value when congestion was detected. 

e The second way in which slow start may end is directly tied to the value of ssthresh. Since 
ssthresh is half the value of cwnd when congestion was last detected, it might be a bit 
reckless to keep doubling cwnd when it reaches or surpasses the value of ssthresh. Thus, 
when the value of cwnd equals ssthresh, slow start ends and TCP transitions into congestion 
avoidance mode. 

e If three duplicate ACKs are detected, in which case TCP performs a fast retransmit and 


enters the fast recovery state. 
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dupACKcount++ dupACKcount=0 dupaAcKeount=0 
transmit new segment(s), as allowed transmit new segment(s), as allowed 
ssthresh=64 KB cwnd2ssthresh 
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S Sow —— NS Congestion 
stat avoidance 
timeout timeout 
———— ————— duplicate ACK 
ssthresh=cwnd/2 ssthresh=cwnd/2 ———— 
cwnd=1 MSS cwnd=1 Mss dupACKcount ++ 
dupACcKcount=0 dupACKcount=0 


retransmit missing segment retransmit missing segment 









timeout 


TH ye ACK 
ssthresh=cwnd/2 pew AC 


cwnd=1 cynd=ssthresh 
dupacKcount=0 dupAcKcount=0 
retransmit missing segment 





dupacKcount==3 
ssthresh=cwnd/2 
cwnd=ssthresht3-MSS 
retransmit missing segment 


dupacKcount: 
ssthresh=cwnd/2 
cwnd-ssthresh+3.mss 
retransmit missing segment 





cwnd=cwnd+mss 
transmit new segment(s), as allowed 


2) Congestion Avoidance 

¢ On entry to the congestion-avoidance state, the value of cwnd is approximately half its value 
when congestion was last encountered—congestion could be just around the corner! Thus, 
rather than doubling the value of cwnd every RTT, TCP adopts a more conservative 
approach and increases the value of cwnd by just a single MSS every RTT. 

© TCP’s congestion-avoidance algorithm behaves the same when a timeout occurs. As in the 
case of slow start: The value of cwnd is set to 1 MSS, and the value of ssthresh is updated to 


half the value of cwnd when the loss event occurred. 


3) Fast Recovery 
e In fast recovery, the value of cwnd is increased by 1 MSS for every duplicate ACK received 


for the missing segment that caused TCP to enter the fast-recovery state. 
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e Eventually, when an ACK arrives for the missing segment, TCP enters the congestion- 
avoidance state after deflating cwnd. 

e Ifa timeout event occurs, fast recovery transitions to the slow-start state after performing the 
same actions as in slow start and congestion avoidance: The value of cwnd is set to 1 MSS, 


and the value of ssthresh is set to half the value of cwnd when the loss event occurred. 


Macroscopic Description of TCP Throughput: 


cai: f ; 0.75 - W 
average t of a connection = ———— 
average throughput of a connection RIT 
Where W is window size 
TCP Over High-Bandwidth Paths: 
1,22 . MSS 
average throughput of a connection = ————— 
RIT VL 


Lis loss rate L 
RTT is the round-trip time 


MSS is maximum segment size 
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Module — 1 
APPLICATION LAYER 


1.1 Principles of Network Applications 

Network application development is writing programs that run on different end systems and 
communicate with each other over the network. 

For example, in the Web application there are two distinct programs that communicate with each 
other: the browser program running in the user’s host and the Web server program running in the 


Web server host. 


1.1.1 Network Application Architectures. 
There are two different network application architecture, they are 
1) Client Server Architecture 


2) P2P Architecture 


Client Server Architecture: 

e In client-server architecture, there is an always-on host, called the server, which provides 
services when it receives requests from many other hosts, called clients. 

Example: In Web application Web server services requests from browsers running on client 
hosts. When a Web server receives a request for an object from a client host, it responds by 
sending the requested object to the client host. 

e Inclient-server architecture, clients do not directly communicate with each other. 

e The server has a fixed, well-known address, called an IP address. Because the server has a 
fixed, well-known address, and because the server is always on, a client can always contact 
the server by sending a packet to the server’s IP address. 

e Some of the better-known applications with a client-server architecture include the Web, 


FTP, Telnet, and e-mail. 
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Client Server Architecture 
In a client-server application, a single-server host is incapable of keeping up with all the 
requests from clients. For this reason, a data center, housing a large number of hosts, is often 
used to create a powerful virtual server. 
The most popular Internet services—such as search engines (e.g., Google and Bing), Internet 
commerce (e.g., Amazon and e-Bay), Web-based email (e.g., Gmail and Yahoo Mail), social 


networking (e.g., Facebook and Twitter)— employ one or more data centers. 


Peer-to-peer (P2P) Architecture: 


In a P2P architecture, there is minimal dependence on dedicated servers in data centers. 
The application employs direct communication between pairs of intermittently connected 
hosts, called peers. 
The peers are not owned by the service provider, but are instead desktops and laptops 
controlled by users, with most of the peers residing in homes, universities, and offices. 
Many of today’s most popular and traffic-intensive applications are based on P2P 
architectures. These applications include file sharing (e.g., BitTorrent), Internet Telephony 
(e.g., Skype), and IPTV (e.g., Kankan and PPstream). 
Features: 
= Self-scalability: 
For example, in a P2P file-sharing application, although each peer generates workload by 
requesting files, each peer also adds service capacity to the system by distributing files to 
other peers. 
= Cost effective: 
P2P architectures are also cost effective, since they normally don’t require significant 


server infrastructure and server bandwidth 
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P2P Architecture 


Future P2P applications face three major challenges: 


1 


ISP Friendly. Most residential ISPs have been dimensioned for “asymmetrical” bandwidth 
usage, that is, for much more downstream than upstream traffic. But P2P video streaming 
and file distribution applications shift upstream traffic from servers to residential ISPs, 
thereby putting significant stress on the ISPs. Future P2P applications need to be designed so 
that they are friendly to ISPs 

Security. Because of their highly distributed and open nature, P2P applications can be a 
challenge to secure 

Incentives. The success of future P2P applications also depends on convincing users to 
volunteer bandwidth, storage, and computation resources to the applications, which is the 


challenge of incentive design. 


1.1.2 Processes Communicating 


A Process is a program or application under execution. 

When processes are running on the same or different end system, they can communicate with 
each other with inter process communication, using rules that are governed by the end 
system’s operating system. 

Processes on two different end systems communicate with each other by exchanging 
messages across the computer network. A sending process creates and sends messages into 
the network; a receiving process receives these messages and possibly responds by sending 


messages back. 
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Client and Server Processes 


A network application consists of pairs of processes that send messages to each other over a 
network. 

For example, in the Web application a client browser process exchanges messages with a 
Web server process. 

In the context of a communication session between a pair of processes, the process that 
initiates the communication is labeled as the client. The process that waits to be contacted to 


begin the session is the server. 


The Interface between the Process and the Computer Network 


A process sends messages into, and receives messages from, the network through a software 
interface called a socket. 

It is also referred to as the Application Programming Interface (API) between the application 
and the network, since the socket is the programming interface with which network 
applications are built. 

The application at the sending side pushes messages through the socket. At the other side of 
the socket, the transport-layer protocol has the responsibility of getting the messages to the 


socket of the receiving process. 


Host or 
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Addressing Processes 


For a process running on one host to send packets to a process running on another host, the 
receiving process needs to have an address. 
To identify the receiving process, two pieces of information need to be specified: 

(1) The address of the host 

(2) An identifier that specifies the receiving process in the destination host. 
In the Internet, the host is identified by its IP address. 
In addition to knowing the address of the host to which a message is destined, the sending 
process must also identify the receiving process running in the host. A destination port 
number serves this purpose. Popular applications have been assigned specific port numbers. 
For example, a Web server is identified by port number 80. A mail server process (using the 


SMTP protocol) is identified by port number 25. 


1.1.3 Transport Services Available to Applications 
1) Reliable Data Transfer 


Packets can get lost within a computer network. For example, a packet can overflow a buffer 
in a router, or can be discarded by a host or router after having some of its bits corrupted. 

For many applications—such as electronic mail, file transfer, remote host access, Web 
document transfers, and financial applications—data loss can have devastating consequences. 
Thus, to support these applications, something has to be done to guarantee that the data sent 
by one end of the application is delivered correctly and completely to the other end of the 
application. 


If a protocol provides such a guaranteed data delivery service, it is 





said to provide reliable 
data transfer. One important service that a transport-layer protocol can potentially provide to 
an application is process-to-process reliable data transfer. 

When a transport protocol provides this service, the sending process can just pass its data into 
the socket and know with complete confidence that the data will arrive without errors at the 
receiving process. 

When a transport-layer protocol doesn’t provide reliable data transfer, some of the data sent 


by the sending process may never arrive at the receiving process. This may be acceptable for 
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loss-tolerant applications, most notably multimedia applications such as conversational 


audio/video that can tolerate some amount of data loss. 


2) Throughput 


Transport-layer protocol could provide guaranteed available throughput at some specified 
rate. 

With such a service, the application could request a guaranteed throughput of r bits/sec, and 
the transport protocol would then ensure that the available throughput is always at least r 
bits/sec. Such a guaranteed throughput service would appeal to many applications. 

For example, if an Internet telephony application encodes voice at 32 kbps, it needs to send 
data into the network and have data delivered to the receiving application at this rate. 

Tf the transport protocol cannot provide this throughput, the application would need to encode 
at a lower rate or may have to give up. 

Applications that have throughput requirements are said to be bandwidth-sensitive 
applications. Many current multimedia applications are bandwidth sensitive 

Elastic applications can make use of as much, or as little, throughput as happens to be 


available. Electronic mail, file transfer, and Web transfers are all elastic applications. 


3) Timing 


A transport-layer protocol can also provide timing guarantees. 
Interactive real-time applications, such as Internet telephony, virtual environments, 
teleconferencing, and multiplayer games require tight timing constraints on data delivery in 


order to be effective. 


4) Security 


Transport protocol can provide an application with one or more security services. 

For example, in the sending host, a transport protocol can encrypt all data transmitted by the 
sending process, and in the receiving host, the transport-layer protocol can decrypt the data 
before delivering the data to the receiving process. 

A transport protocol can provide security services like confidentiality, data integrity and end- 


point authentication. 
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1.1.4 Transport Services Provided by the Internet 


The Internet makes two transport protocols available to applications, UDP and TCP. 


























Application Data Loss Throughput Time Sensitive 

File tronsfer/download No loss Elastic No 

Email No loss Elastic No 

Web documents No loss Elastic (few kbps) No 

Internet telephony/ Loss-tolerant Audio: few kbps—1 Mbps Yes: 100s of msec 
Video conferencing Video: 10 kbps—5 Mbps 

Streaming stored Loss-tolerant Same as above Yes: few seconds 
oudio/video 

Interoctive games Loss-toleront Few kbps—10 kbps Yes: 100s of msec 
Instant messaging No loss Elostic Yes ond no 





Requirements of selected network applications 


TCP Services 

The TCP service model includes a connection-oriented service and a reliable data transfer 

service. 

1) Connection-oriented service: 

e¢ In TCP the client and server exchange transport layer control information with each other 
before the application-level messages begin to flow. 

e This handshaking procedure alerts the client and server, allowing them to prepare for an 
onslaught of packets. 

e After the handshaking phase, a TCP connection is said to exist between the sockets of the 
two processes. 

e The connection is a full-duplex connection in that the two processes can send messages to 
each other over the connection at the same time. 


e¢ When the application finishes sending messages, it must tear down the connection. 


2) Reliable data transfer service: 
e The communicating processes can rely on TCP to deliver all data sent without error and in 


the proper order. 
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e When one side of the application passes a stream of bytes into a socket, it can count on TCP 
to deliver the same stream of bytes to the receiving socket, with no missing or duplicate 
bytes. 


TCP also includes a congestion-contro] mechanism. 


UDP Services 

« UDP is connectionless, so there is no handshaking before the two processes start to 
communicate. 

e UDP provides an unreliable data transfer service—that is, when a process sends a message 
into a UDP socket, UDP provides no guarantee that the message will ever reach the receiving 
process. 

e UDP does not include a congestion-control mechanism, so the sending side of UDP can 


pump data into the layer below (the network layer) at any rate it pleases. 























Application Application-Layer Protocol Underlying Transport Protocol 
Electronic mail SMTP [RFC 5321] Te 

Remote terminal access Telnet [RFC 854] Te? 

Web HTTP [RFC 2616] Te 

File transfer FIP [RFC 959] TP 

Streaming multimedia HTTP (e.g., YouTube) Tee 

Intemet telephony SIP [RFC 3261}, RIP [RFC 3550], or proprietary UDP or TCP 

(e.g., Skype) 





Popular Internet applications, their application-layer protocols, and their underlying transport 


protocols 


1.1.5 Application-Layer Protocols 
An application-layer protocol defines: 
e The types of messages exchanged, for example, request messages and response messages 


e The syntax of the various message types, such as the fields in the message and how the fields 


are delineated 
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The semantics of the fields, that is, the meaning of the information in the fields 


Rules for determining when and how a process sends messages and responds to messages. 


1.2 The Web and HTTP 
1.2.1 Overview of HTTP 


The Hyper Text Transfer Protocol (HTTP), the Web’s application-layer protocol, is at the 
heart of the Web. 

HTTP is implemented in two programs: a client program and a server program. The client 
program and server program, executing on different end systems, talk to each other by 
exchanging HTTP messages. HTTP defines the structure of these messages and how the 
client and server exchange the messages. 

A Web page consists of objects. An object is simply a file like HTML file, a JPEG image, a 
Java applet, or a video clip—that is addressable by a single URL. 

Most Web pages consist of a base HTML file and several referenced objects. For example, if 
a Web page contains HTML text and five JPEG images, then the Web page has six objects: 
the base HTML file plus the five images. 

The base HTML file references the other objects in the page with the objects’ URLs. Each 
URL has two components: the hostname of the server that houses the object and the object’s 
path name. 

For example, the URL http://www.saividya.ac.in/home/picture.gif has www.saividya.ac.in 
for a hostname and /home/picture.gif for a path name. 

HTTP defines how Web clients request Web pages from Web servers and how servers 
transfer Web pages to clients. 

When a user requests a Web page (for example, clicks on a hyperlink), the browser sends 
HTTP request messages for the objects in the page to the server. The server receives the 
requests and responds with HTTP response messages that contain the objects. 

HTTP uses TCP as its underlying transport protocol. The HTTP client first initiates a TCP 
connection with the server. Once the connection is established, the browser and the server 


processes access TCP through their socket interfaces. 
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e It is important to note that the server sends requested files to clients without storing any state 
information about the client. If a particular client asks for the same object twice in a period of 
a few seconds, the server does not respond by saying that it just served the object to the 
client; instead, the server resends the object, as it has completely forgotten what it did earlier. 
Because an HTTP server maintains no information about the clients, HTTP is said to be a 


stateless protocol. 


Server running 
Apache Web server 





PC running Linux running 
Internet Explorer Firefox 


1.2.2 Non-Persistent and Persistent Connections 

If Separate TCP connection is used for each request and response, then the connection is said to 

be non persistent. If same TCP connection is used for series of related request and response, then 

the connection is said to be persistent. 

HTTP with Non-Persistent Connections 

Let’s suppose the page consists of a base HTML file and 10 JPEG images, and that all 11 of 

these objects reside on the same server. 

Further suppose the URL for the base HTML file is 

http://www.someSchool.edu/someDepartment/home.index 

Here is what happens: 

1. The HTTP client process initiates a TCP connection to the server www.someSchool.edu on 
port number 80, which is the default port number for HTTP. Associated with the TCP 


connection, there will be a socket at the client and a socket at the server. 
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The HTTP client sends an HTTP request message to the server via its socket. The request 
message includes the path name /someDepartment/home.index. 

The HTTP server process receives the request message via its socket, retrieves the object 
/someDepartment/home.index from its storage (RAM or disk), encapsulates the object in an 
HTTP response message, and sends the response message to the client via its socket. 

The HTTP server process tells TCP to close the TCP connection. 

The HTTP client receives the response message. The TCP connection terminates. The 
message indicates that the encapsulated object is an HTML file. The client extracts the file 
from the response message, examines the HTML file, and finds references to the 10 JPEG 
objects. 


The first four steps are then repeated for each of the referenced JPEG objects. 








“}rime to transmit file 


Time Time 
at dient at server 


Round-trip time (RTT) is the time it takes for a small packet to travel from client to server 
and then back to the client. 

The RTT includes packet-propagation delays, packet queuing delays in intermediate routers 
and switches, and packet-processing delays. 

When a user clicks on a hyperlink, the browser initiate a TCP connection between the 
browser and the Web server; this involves a “three-way handshake”—the client sends a small 
TCP segment to the server, the server acknowledges and responds with a small TCP segment, 


and, finally, the client acknowledges back to the server. 
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The first two parts of the three way handshake take one RTT. 

After completing the first two parts of the handshake, the client sends the HTTP request 
message combined with the third part of the three-way handshake (the acknowledgment) into 
the TCP connection. 

Once the request message arrives at the server, the server sends the HTML file into the TCP 
connection. This HTTP request/response eats up another RTT. Thus, roughly, the total 


response time is two RTTs plus the transmission time at the server of the HTML file. 


HTTP with Persistent Connections 


Non-persistent connections have some shortcomings. 


1. 


Nv 


A brand-new connection must be established and maintained for each requested object. For 
each of these connections, TCP buffers must be allocated and TCP variables must be kept in 
both the client and server. This can place a significant burden on the Web server, which may 
be serving requests from hundreds of different clients simultaneously. 

Each object suffers a delivery delay of two RTTs— one RTT to establish the TCP connection 


and one RTT to request and receive an object. 


With persistent connections, the server leaves the TCP connection open after sending a response. 


Subsequent requests and responses between the same client and server can be sent over the same 


connection. In particular, an entire Web page can be sent over a single persistent TCP 


connection. Moreover, multiple Web pages residing on the same server can be sent from the 


server to the same client over a single persistent TCP connection. 


1.2.3 HTTP Message Format 
HTTP Request Message: 


Request line 


Header lines 


Blank line——— cr| if 


Entity body- 








method |sp URL sp| Version | cr | if 


























header field name: |sp] value | cr] If 








header field name: |sp| value | cr] if 



































Ly _J 
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Where sp — space, cr — carriage return and If — line feed. 
Method: 
There are five HTTP methods: 


GET: The GET method is used when the browser requests an object, with the requested 
object identified in the URL field. 

POST: With a POST message, the user is still requesting a Web page from the server, 
but the specific contents of the Web page depend on what the user entered into the form 
fields. If the value of the method field is POST, then the entity body contains what the 
user entered into the form fields. 

PUT: The PUT method is also used by applications that need to upload objects to Web 
servers. 

HEAD: Used to retrieve header information. It is used for debugging purpose. 
DELETE: The DELETE method allows a user, or an application, to delete an object on a 


Web server. 


URL: Specifies URL of the requested object 
Version: This field represents HTTP version, usually HTTP/1.1 


Header line: 


Ex: 


Host: www.someschool.edu 


Connection: close 


User-agent: Mozilla/5.0 


Accept-language: fr 


The header line Host:www.someschool.edu specifies the host on which the object resides. 


By including the Connection:close header line, the browser is telling the server that it doesn’t 


want to bother with persistent connections; it wants the server to close the connection after 


sending the requested object. 


The User-agent: header line specifies the user agent, that is, the browser type that is making the 


request to the server. Here the user agent is Mozilla/S.0, a Firefox browser. 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 13 


Module — 1: Application Layer 





The Accept-language: header indicates that the user prefers to receive a French version of the 


object, if such an object exists on the server; otherwise, the server should send its default version. 


HTTP Response Message 





Status line. 





version |sp| statuscode |sp} phrase | cr| If 




















header field name: |sp| value | cr] If 





Header lines: 








header field name: |sp| value | cr| If 

















Blank line cr} If 














Entity body——]_ 











Ex: 








HTTP/1.1 200 OK 

Connection: close 

Date: Tue, 09 Aug 2011 15:44:04 GMT 

Server: Apache/2.2.3 (CentOS) 

Last-Modified: Tue, 09 Aug 2011 15:11:03 GMT 
Content-Length: 6821 

Content-Type: text/html 


(data data data data data ...) 





The status line has three fields: the protocol version field, a status code, and a corresponding 
status message. 

Version is HTTP/1.1 

The status code and associated phrase indicate the result of the request. Some common status 
codes and associated phrases include: 


+ 200 OK: Request succeeded and the information is returned in the response. 
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+ 301 Moved Permanently: Requested object has been permanently moved; the new URL is 
specified in Location: header of the response message. The client software will automatically 
retrieve the new URL. 
+ 400 Bad Request: This is a generic error code indicating that the request could not be 
understood by the server. 
+ 404 Not Found: The requested document does not exist on this server. 
+ 505 HTTP Version Not Supported: The requested HTTP protocol version is not supported by 
the server. 
Header fields: 
e The server uses the Connection: close header line to tell the client that it is going to close 
the TCP connection after sending the message. 
e The Date: header line indicates the time and date when the HTTP response was created 
and sent by the server. 
e The Server: header line indicates that the message was generated by an Apache Web 
server; it is analogous to the User-agent: header line in the HTTP request message. 
e The Last-Modified: header line indicates the time and date when the object was created 
or last modified. 
e The Content-Length: header line indicates the number of bytes in the object being sent. 


© The Content-Type: header line indicates that the object in the entity body is HTML text. 


1.2.4 User-Server Interaction: Cookies 

It is often desirable for a Web site to identify users, either because the server wishes to restrict 
user access or because it wants to serve content as a function of the user identity. For these 
purposes, HTTP uses cookies. 

Cookie technology has four components: 

(1) A cookie header line in the HTTP response message; 

(2) A cookie header line in the HTTP request message; 

(3) A cookie file kept on the user’s end system and managed by the user’s browser; 


(4) A back-end database at the Web site. 


Ex: 
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Suppose a user, who always accesses the Web using Internet Explorer from her home PC, 
contacts Amazon.com for the first time. Let us suppose that in the past he has already visited the 
eBay site. When the request comes into the Amazon Web server, the server creates a unique 
identification number and creates an entry in its back-end database that is indexed by the 
identification number. The Amazon Web server then responds to Susan’s browser, including in 
the HTTP response a Set-cookie: header, which contains the identification number. 

For example, the header line might be: 

Set-cookie: 1678 

When users browser receives the HTTP response message, it sees the Set-cookie: header. The 
browser then appends a line to the special cookie file that it manages. This line includes the 
hostname of the server and the identification number in the Set-cookie: header. 

As user continues to browse the Amazon site, each time he requests a Web page, his browser 
consults his cookie file, extracts his identification number for this site, and puts a cookie header 
line that includes the identification number in the HTTP request. Specifically, each of his HTTP 
requests to the Amazon server includes the header line: 


Cookie: 1678 


Client host Server host 


Be F 
B— 2 
Atep regy 
ebay: 8734 C8 mag 
— Server creates 


respons? 
neee © ee ID 1678 for user 
ysvat BEEP | 16 See entry in backend 
set-co database 
Waal he 
LCkig, 
sang = ge 


ebay: 8734 souk 
s — Cookie-specific <= >| 
onse_ SS! action 


: 


al hy ‘access 


ttp 
| oe 17 E°M CSE nag 


amazon: 1678 
— Cookie-specific 


ebay: 8734 
; eS action 
pai Be 


Time Time 


Dne week later. 
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1.2.5 Web Caching 

e A Web cache—also called a proxy server—is a network entity that satisfies HTTP requests 
on the behalf of an origin Web server. 

e The Web cache has its own disk storage and keeps copies of recently requested objects in this 
storage. 


e A.user’s browser can be configured so that all of the user’s HTTP requests are first directed 


to the Web cache. 





Sm Proxy - ot a 
server > < 
P— Suess ey) ae 
Client Ves et Origin 
One a server 
Pp, 
5 
Ue, 
4 ~ig 
ios 
‘Me 
Origin 
server 


Ex: Suppose a browser is requesting the object http://www.someschool.edu/campus.gif. Here 

is what happens: 

1. The browser establishes a TCP connection to the Web cache and sends an HTTP request 
for the object to the Web cache. 

2. The Web cache checks to see if it has a copy of the object stored locally. If it does, the 
Web cache returns the object within an HTTP response message to the client browser. 

3. If the Web cache does not have the object, the Web cache opens a TCP connection to the 
origin server, that is, to www.someschool.edu. The Web cache then sends an HTTP 
request for the object into the cache-to-server TCP connection. 

4. After receiving this request, the origin server sends the object within an HTTP response 
to the Web cache. 

5. When the Web cache receives the object, it stores a copy in its local storage and sends a 
copy, within an HTTP response message, to the client browser (over the existing TCP 


connection between the client browser and the Web cache). 
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When web cache receives requests from and sends responses to a browser, it is a server. 
When it sends requests to and receives responses from an origin server, it is a client. 
Typically a Web cache is purchased and installed by an ISP. For example, a university might 
install a cache on its campus network and configure all of the campus browsers to point to 
the cache. Or a major residential ISP (such as AOL) might install one or more caches in its 
network and pre configure its shipped browsers to point to the installed caches. 

Web caching has seen deployment in the Internet for two reasons. First, a Web cache can 
substantially reduce the response time for a client request. Second, Web caches can 


substantially reduce traffic on an institution’s access link to the Internet. 


1.2.6 The Conditional GET 


Ex: 


Although caching can reduce user-perceived response times, it introduces a new problem— 
the copy of an object residing in the cache may be stale. In other words, the object housed in 
the Web server may have been modified since the copy was cached at the client. 

HTTP has a mechanism that allows a cache to verify that its objects are up to date. This 
mechanism is called the conditional GET. 

An HTTP request message is a so-called conditional GET message if (1) the request message 
uses the GET method and (2) the request message includes an If-Modified- Since: header 


line. 


First, on the behalf of a requesting browser, a proxy cache sends a request message to a Web 


server: 





GET /fruit/kiwi.gif HTTP/1.1 


Host: www.exotiquecuisine.com 





Second, the Web server sends a response message with the requested object to the cache: 








HTTP/1.1 200 OK 

Date: Sat, 8 Oct 2011 15:39:29 

Server: Apache/1.3.0 (Unix) 
Last-Modified: Wed, 7 Sep 2011 09:23:24 
Content-Type: image/gif 

(data data data data data ...) 
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The cache forwards the object to the requesting browser but also caches the object locally. 
Importantly, the cache also stores the last-modified date along with the object. 

Third, one week later, another browser requests the same object via the cache, and the object is 
still in the cache. Since this object may have been modified at the Web server in the past week, 
the cache performs an up-to-date check by issuing a conditional GET. Specifically, the cache 


sends: 








GET /fruit/kiwi.gif HTTP/1.1 
Host: www.exotiquecuisine.com 


If-modified-since: Wed, 7 Sep 2011 09:23:24 





This conditional GET is telling the server to send the object only if the object has been modified 
since the specified date. 
Suppose the object has not been modified since 7 Sep 2011 09:23:24. Then, fourth, the Web 


server sends a response message to the cache: 








HTTP/1.1 304 Not Modified 
Date: Sat, 15 Oct 2011 15:39:29 
Server: Apache/1.3.0 (Unix) 
(empty entity body) 





We see that in response to the conditional GET, the Web server still sends a response message 


but does not include the requested object in the response message. 


1.3 File Transfer: FTP 


e FTP is used for transferring file from one host to another host. 

e In order for the user to access the remote account, the user must provide user identification 
and a password. After providing this authorization information, the user can transfer files 
from the local file system to the remote file system and vice versa. 

e The user first provides the hostname of the remote host, causing the FTP client process in the 
local host to establish a TCP connection with the FTP server process in the remote host. 

e The user then provides the user identification and password, which are sent over the TCP 


connection as part of FTP commands. 
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¢ Once the server has authorized the user, the user copies one or more files stored in the local 


file system into the remote file system (or vice versa). 











[ 
‘FTP user FTP File transfer 
©) gay = jo 
L LE 
User 
or host 


Local file Remote file 
system system 














! 


e FTP uses two parallel TCP connections to transfer a file, a control connection and a data 
connection. 

e The control connection is used for sending control information between the two hosts— 
information such as user identification, password, commands to change remote directory, and 
commands to “put” and “get” files. 

e The data connection is used to actually send a file. 


TCP control connection port 21 
ey 


ee 
TcP data connection port 20 


FIP FIP 
client server 


e When a user starts an FTP session with a remote host, the client side of FTP (user) first 
initiates a control TCP connection with the server side (remote host) on server port number 
21. 

e The client side of FTP sends the user identification and password over this control 
connection. The client side of FTP also sends, over the control connection, commands to 
change the remote directory. 

e When the server side receives a command for a file transfer over the control connection 
(either to, or from, the remote host), the server side initiates a TCP data connection to the 


client side. 
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e FTP sends exactly one file over the data connection and then closes the data connection. If, 
during the same session, the user wants to transfer another file, FTP opens another data 
connection. 

e Thus, with FTP, the control connection remains open throughout the duration of the user 


ion, but a new data connection is created for each file transferred within a s 





sion (that is, 
the data connections are non-persistent). 

e Throughout a session, the FTP server must maintain state about the user. In particular, the 
server must associate the control connection with a specific user account, and the server must 


keep track of the user’s current directory as the user wanders about the remote directory tree. 


1.3.1 FTP Commands and Replies 


Some of the more common commands are given below: 

* USER username: Used to send the user identification to the server. 

+ PASS password: Used to send the user password to the server. 

+ LIST: Used to ask the server to send back a list of all the files in the current remote directory. 
The list of files is sent over a (new and non-persistent) data connection rather than the control 
TCP connection. 

+ RETR filename: Used to retrieve (that is, get) a file from the current directory of the remote 
host. This command causes the remote host to initiate a data connection and to send the 
requested file over the data connection. 

+ STOR filename: Used to store (that is, put) a file into the current directory of the remote host. 
Each command is followed by a reply, sent from server to client. The replies are three-digit 
numbers, with an optional message following the number. 

+ 331 Username OK, password required 

* 125 Data connection already open; transfer starting 

+ 425 Can’t open data connection 


+ 452 Error writing file 


1.4 Electronic Mail in the Internet 


E-mail has three major components: user agents, mail servers, and the Simple Mail Transfer 


Protocol (SMTP). 
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e User agents allow users to read, reply to, forward, save, and compose messages. 

e Mail servers form the core of the e-mail infrastructure. Each recipient has a mailbox located 
in one of the mail servers. A typical message starts its journey in the sender’s user agent, 
travels to the sender’s mail server, and travels to the recipient’s mail server, where it is 
deposited in the recipient’s mailbox. 

e SMTP is the principal application-layer protocol for Internet electronic mail. It uses the 
reliable data transfer service of TCP to transfer mail from the sender’s mail server to the 
recipient’s mail server. As with most application-layer protocols, SMTP has two sides: a 
client side, which executes on the sender’s mail server, and a server side, which executes on 


the recipient’s mail server. 


1.4.1 SMTP 

SMTP transfers messages from senders’ mail servers to the recipients’ mail servers. It restricts 
the body (not just the headers) of all mail messages to simple 7-bit ASCII. 

Suppose Alice wants to send Bob a simple ASCII message. 

1. Alice invokes her user agent for e-mail, provides Bob’s e-mail address (for example, 


bob@someschool.edu), composes a message, and instructs the user agent to send the message. 
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2. Alice’s user agent sends the message to her mail server, where it is placed in a message queue. 
3. The client side of SMTP, running on Alice’s mail server, sees the message in the message 
queue. It opens a TCP connection to an SMTP server, running on Bob’s mail server. 

4. After some initial SMTP handshaking, the SMTP client sends Alice’s message into the TCP 
connection. 

5. At Bob’s mail server, the server side of SMTP receives the message. Bob’s mail server then 
places the message in Bob’s mailbox. 


6. Bob invokes his user agent to read the message at his convenience. 





“Alice's Bob's 
= mail server mail server 


bf ees oo SMTP fen & 


An example transcript of messages exchanged between an SMTP client (C) and an SMTP server 


(S). 





























S: 220 hamburger.edu 

C: HELO crepes.fr 

S: 250 Hello crepes.fr, pleased to meet you 

C: MAIL FROM: <alice @crepes.fr> 

S: 250 alice @crepes.fr ... Sender ok 

C: RCPT TO: <bob@hamburger.edu> 

S: 250 bob@hamburger.edu ... Recipient ok 

C: DATA 

S: 354 Enter mail, end with “.” ona line by itself 
C: Do you like ketchup? 

C: How about pickles? 
Cc 
Ss 


: 250 Message accepted for delivery 
C: QUIT 


S: 221 hamburger.edu closing connection 
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1.4.2 Comparison with HTTP 





HTTP 


SMTP 





Pull Protocol- someone loads information on a 
Web server and users use HTTP to pull the 
information from the server at their 


convenience. 


Push Protocol- the sending mail server pushes 


the file to the receiving mail server. 





HTTP does not mandates data to be in 7-bit 
ASCII format. 


SMTP requires each message, including the 
body of each message, to be in 7-bit ASCII 


format. 








HTTP encapsulates each object in its own 


HTTP response message. 





Internet mail places all of the message’s 


objects into one message. 





1.4.3 Mail Message Formats 


When an e-mail message is sent from one person to another, a header containing peripheral 


information precedes the body of the message. 


The header lines and the body of the message are separated by a blank line. 


Every header must have a From: header line and a To: header line; a header may include a 


Subject: header line as well as other optional header lines. 


A typical message header looks like this: 








From: alice @crepes.fr 
To: bob@hamburger.edu 


Subject: Searching for the meaning of life. 





1.4.4 Mail Access Protocols 


SMTP protocol delivers the mail to the mail server. To fetch the mail from mail server receiver 


used mail access protocols. 


There are currently a number of popular mail access protocols, including Post Office Protocol— 


Version 3 (POP3), Internet Mail Access Protocol (IMAP), and HTTP. 
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POP3 


POP3 is an extremely simple mail access protocol. 

POP3 begins when the user agent (the client) opens a TCP connection to the mail server (the 
server) on port 110. 

With the TCP connection established, POP3 progresses through three phases: authorization, 
transaction, and update. 

During the authorization phase, the user agent sends a username and a password to 
authenticate the user. 

During the transaction phase, the user agent retrieves messages; also during this phase, the 
user agent can mark messages for deletion, remove deletion marks, and obtain mail statistics. 

The update phase occurs after the client has issued the quit command, ending the POP3 
session; at this time, the mail server deletes the messages that were marked for deletion. 

In a POP3 transaction, the user agent issues commands, and the server responds to each 
command with a reply. There are two possible responses: +OK used by the server to indicate 
that the previous command was fine; and -ERR, used by the server to indicate that something 
was wrong with the previous command. 

The authorization phase has two principal commands: user <username> and pass 


<password>. 








user bob 
+OK 
pass hungry 


+OK user successfully logged on 





Ex: 


A user agent using POP3 can often be configured (by the user) to “download and delete” or 
to “download and keep.” 


In the download-and-delete mode, the user agent will issue the list, retr, and delecommands. 





C: list 
S: 1 498 
S:2912 


S:. 
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C: retr 1 
S: (blah blah ... 





:retr 2 





: dele 2 
: quit 
: +OK POP3 server signing off 


PAALeEBYAAL 


e A problem with this download-and-delete mode is that the recipient cannot access mail 
messages from multiple machines. 
e In the download-and keep mode, the user agent leaves the messages on the mail server after 


downloading them. In this case, user can reread messages from different machines. 
IMAP 


e With POP3 access, once user has downloaded his messages to the local machine, he can 
create mail folders and move the downloaded messages into the folders. User can then delete 
messages, move messages across folders, and search for messages (by sender name or 
subject). 

e But this paradigm—namely, folders and messages in the local machine—poses a problem for 
the nomadic user, who would prefer to maintain a folder hierarchy on a remote server that 
can be accessed from any computer. This is not possible with POP3—the POP3 protocol 
does not provide any means for a user to create remote folders and assign messages to 
folders. 

© To solve this and other problems, the IMAP protocol was invented. Like POP3, IMAP is a 
mail access protocol. It has many more features than POP3, but it is also significantly more 


complex. 
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An IMAP server will associate each message with a folder; when a message first arrives at 
the server, it is associated with the recipient’s INBOX folder. 

The recipient can then move the message into a new, user-created folder, read the message, 
delete the message, and so on. 

The IMAP protocol provides commands to allow users to create folders and move messages 
from one folder to another. 

IMAP also provides commands that allow users to search remote folders for messages 
matching specific criteria. 

Another important feature of IMAP is that it has commands that permit a user agent to obtain 
components of messages. For example, a user agent can obtain just the message header of a 
message or just one part of a multipart MIME message. This feature is useful when there is a 
low-bandwidth connection (for example, a slow-speed modem link) between the user agent 
and its mail server. With a low bandwidth connection, the user may not want to download all 
of the messages in its mailbox, particularly avoiding long messages that might contain, for 


example, an audio or video clip. 


Web-Based E-Mail 


More and more users today are sending and accessing their e-mail through their Web browsers. 


In this case user communicates with its remote mailbox via HTTP. 


1.5 DNS—The Internet’s Directory Service 


All the hosts connected to network is identified by IP address. But it is difficult for human 
beings to remember these IP address to access a particular host. Hence hosts are identified by 
hostnames. Ex: google.com 

But the routers require IP address to forward the packet. 


In order to map hostname with the IP address DNS is used. 


1.5.1 Services Provided by DNS 


The DNS is (1) a distributed database implemented in a hierarchy of DNS servers, and (2) an 


application-layer protocol that allows hosts to query the distributed database. 
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DNS is commonly employed by other application-layer protocols—including HTTP, SMTP, 


and FTP—to translate user-supplied hostnames to IP addresses. 








Example: 


Consider what happens when a browser running on some user’s host, requests the URL 


www.someschool.edu/index.html. 


In order for the user’s host to be able to send an HTTP request message to the Web server 


www.someschool.edu, the user’s host must first obtain the IP address of www.someschool.edu. 


This is done as follows. 


woe 


The same user machine runs the client side of the DNS application. 

The browser extracts the hostname, www.someschool.edu, from the URL and passes the 
hostname to the client side of the DNS application. 

The DNS client sends a query containing the hostname to a DNS server. 


The DNS client eventually receives a reply, which includes the IP address for the hostname. 





Once the browser receives the IP address from DNS, it can initiate a TCP connection to the 


HTTP server process located at port 80 at that IP address. 





DNS provides a few other important services in addition to translating hostnames to IP 


addresses: 


Host aliasing: A host with a complicated hostname can have one or more alias names. For 
example, a hostname such as relay1.west-coast.enterprise.com could have, say, two aliases 
such as enterprise.com and www.enterprise.com. In this case, the hostname relay 1.westcoast. 
enterprise.com is said to be a canonical hostname. Alias hostnames, when present, are 
typically more mnemonic than canonical hostnames. DNS can be invoked by an application 
to obtain the canonical hostname for a supplied alias hostname as well as the IP address of 
the host. 


Mail server aliasing: For obvious reasons, it is highly desirable that e-mail addresses be 





mnemonic. For example, if Bob has an account with Hotmail, Bob’s e-mail address might be 
as simple as bob@hotmail.com. However, the hostname of the Hotmail mail server is more 


complicated and much less mnemonic than simply hotmail.com (for example, the canonical 
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hostname might be something like relay1.west-coast.hotmail.com). DNS can be invoked by a 
mail application to obtain the canonical hostname for a supplied alias hostname as well as the 
TP address of the host. 

e Load distribution: DNS is also used to perform load distribution among replicated servers, 
such as replicated Web servers. Busy sites, such as cnn.com, are replicated over multiple 


servers, with each server running on a different end system and each having a different IP 





address. For replicated Web servers, a set of IP addresses is thus associated with one 
canonical hostname. The DNS database contains this set of IP addresses. When clients make 
a DNS query for a name mapped to a set of addresses, the server responds with the entire set 
of IP addresses, but rotates the ordering of the addresses within each reply. Because a client 
typically sends its HTTP request message to the IP address that is listed first in the set, DNS 


rotation distributes the traffic among the replicated servers. 
1.5.2 Overview of How DNS Works 


e Suppose that some application running in a user’s host needs to translate a hostname to an IP 
address. The application will invoke the client side of DNS, specifying the hostname that 
needs to be translated. 

e DNS in the user’s host then takes over, sending a query message into the network. 

e All DNS query and reply messages are sent within UDP datagrams to port 53. After a delay, 
ranging from milliseconds to seconds, DNS in the user’s host receives a DNS reply message 


that provides the desired mapping. This mapping is then passed to the invoking application. 


In this centralized design, clients simply direct all queries to the single DNS server, and the DNS 
server responds directly to the querying clients. Although the simplicity of this design is 
attractive, it is inappropriate for today’s Internet, with its vast (and growing) number of hosts. 


The problems with a centralized design include: 


e Asingle point of failure. If the DNS server crashes, so does the entire Internet! 
e Traffic volume. A single DNS server would have to handle all DNS queries. 
e Distant centralized database. A single DNS server cannot be “close to” all the querying 


clients. If we put the single DNS server in New York City, then all queries from Australia 
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must travel to the other side of the globe, perhaps over slow and congested links. This can 
lead to significant delays. 

e Maintenance. The single DNS server would have to keep records for all Internet hosts. Not 
only would this centralized database be huge, but it would have to be updated frequently to 


account for every new host. 
A Distributed, Hierarchical Database 


e In order to deal with the issue of scale, the DNS uses a large number of servers, organized in 
a hierarchical fashion and distributed around the world. 
e There are three classes of DNS servers—root DNS servers, top-level domain (TLD) DNS 


servers, and authoritative DNS servers—organized in a hierarchy. 











Root DNS servers 
com DNS servers org DNS servers edu DNS servers 
yahoo.com amazon.com pbs.org poly.edu umass.edu 
DNS servers DNS servers DNS servers DNS servers DNS servers 


e Root DNS servers. In the Internet there are 13 root DNS servers (labeled A through M), 
most of which are located in North America. 
Although we have referred to each of the 13 root DNS servers as if it were a single server, 
each “server” is actually a network of replicated servers, for both security and reliability 
purposes. All together, there are 247 root servers. 

¢ Top-level domain (TLD) servers: These servers are responsible for top-level domains such 
as com, org, net, edu, and gov, and all of the country top-level domains such as in,uk, fr, ca. 

e Authoritative DNS servers: Every organization with publicly accessible hosts on the 
Internet must provide publicly accessible DNS records that map the names of those hosts to 
TP addresses. An organization’s authoritative DNS server houses these DNS records. 

e There is another important type of DNS server called the local DNS server. A local DNS 


server does not strictly belong to the hierarchy of servers but is nevertheless central to the 
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DNS architecture. Each ISP—such as a university, an academic department, an employee’s 
company, or a residential ISP—has a local DNS server. 


Two type of Interaction: 


1) Recursive Queries: 


Root DNS server 


\il 


‘ee. 





Local ONS server TLD DNS server 


fs 


Requesting host 





Here DNS query is sent to local DNS server then to root server, then to TLD server and finally 


to authoritative DNS server. DNS response arrives in the reverse order. 
2) Iterative Queries: 


Root DNS server 


ls 
Tl 


Local fi server LD DNS server 
dns .pely.edu 


oN 


Authoritative DNS server 
dns.umass.edu 


Requesting host 
cis-poly-edu 


gaia.cs-umass.edu 
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Here DNS query will be sent to Local DNS server, then to root server. Root server sends the IP 
address of TLD server. Now local DNS server sends query to TLD DNS server. TLD DNS 
server sends the IP address of authoritative DNS server to local DNS server. Now Local DNS 
server sends query to authoritative DNS server. Authoritative DNS server sends the IP address of 


host to local DNS server. Local DNS server sends it to the host. 


DNS Caching 

In a query chain, when a DNS server receives a DNS reply it can cache the mapping in its local 
memory. 

If a hostname/IP address pair is cached in a DNS server and another query arrives to the 

DNS server for the same hostname, the DNS server can provide the desired IP address, even if it 


is not authoritative for the hostname. Because hos 





ind mappings between hostnames and IP 
addresses are by no means permanent, DNS servers discard cached information after a period of 


time (often set to two days). 


1.5.3 DNS Records and Messages 
The DNS servers that together implement the DNS distributed database store resource records 
(RRs). 
A resource record is a four-tuple that contains the following fields: 
(Name, Value, Type, TTL) 
TTL is the time to live of the resource record; it determines when a resource should be removed 
from a cache. 
The meaning of Name and Value depend on Type: 
e If Type=A, then Name is a hostname and Value is the IP address for the hostname. 
e If Type=NS, then Name is a domain (such as foo.com) and Value is the hostname of an 
authoritative DNS server that knows how to obtain the IP addresses for hosts in the domain. 
e If Type=CNAME, then Value is a canonical hostname for the alias hostname Name. This 
record can provide querying hosts the canonical name for a hostname. 
e If Type=MX, then Value is the canonical name of a mail server that has an alias hostname 


Name. 
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DNS Messages 


Identification Flags 
Number of questions Number of answer RRs [-12 bytes 
Number of authority RRs Number of additional RRs 


Questions L ‘ 
(variable number of questions) —— fields for 


Answers - 
(variable number of resource records) insect a! 


Authority |-Records for 
(variable number of resource records) authoritative servers 
Additional information |_Additional “helpful” 


(variable number of resource records) 





info that may be used 


e The first 12 bytes is the header section, which has a number of fields. 

e The first field is a 16-bit number that identifies the query. This identifier is copied into the 
reply message to a query, allowing the client to match received replies with sent queries. 

e = There are a number of flags in the flag field. 

A 1-bit query/reply flag indicates whether the message is a query (0) or a reply (1). Al-bit 
authoritative flag is set in a reply message when a DNS server is an authoritative server for a 
queried name. 

A 1-bit recursion-desired flag is set when a client (host or DNS server) desires that the DNS 
server perform recursion when it doesn’t have the record. 

A 1-bit recursion available field is set in a reply if the DNS server supports recursion. 

e In the header, there are also four number-of fields. These fields indicate the number of 
occurrences of the four types of data sections that follow the header. 

e The question section contains information about the query that is being made. This section 
includes (1) a name field that contains the name that is being queried, and (2) a type field that 
indicates the type of question being asked about the name 

e Inareply from a DNS server, the answer section contains the resource records for the name 


that was originally queried. 
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e The authority section contains records of other authoritative servers. 


e The additional section contains other helpful records. 
Inserting Records into the DNS Database 


Suppose you have just created an exciting new startup company called Network Utopia. The first 
thing you’ll surely want to do is register the domain name networkutopia.com at a registrar. A 
registrar is a commercial entity that verifies the uniqueness of the domain name, enters the 
domain name into the DNS database (as discussed below), and collects a small fee from you for 


its services. 


For the primary authoritative server for networkutopia.com, the registrar would insert the 


following two resource records into the DNS system: 
(networkutopia.com, dns1.networkutopia.com, NS) 


(dns1.networkutopia.com, 212.212.212.1, A) 
1.6 Peer-to-Peer Applications 


In P2P architecture, there is minimal (or no) reliance on always-on infrastructure servers. 
Instead, pairs of intermittently connected hosts, called peers, communicate directly with each 


other. 
1.6.1 P2P File Distribution 


e In P2P file distribution, each peer can redistribute any portion of the file it has received to 
any other peers, thereby assisting the server in the distribution process. 


e The most popular P2P file distribution protocol is BitTorrent. 


Scalability of P2P Architectures 

As shown in below Figure the server and the peers are connected to the Internet with access 
links. Denote the upload rate of the server’s access link by us, the upload rate of the ith peer’s 
access link by ui, and the download rate of the ith peer’s access link by dj. Also denote the size of 
the file to be distributed (in bits) by F and the number of peers that want to obtain a copy of the 
file by N. 
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The distribution time is the time it takes to get a copy of the file to all N peers. 


File: F =n 






Internet 





In the client-server architecture, none of the peers aids in distributing the file. We make the 


following observations: 


e The server must transmit one copy of the file to each of the N peers. Thus the server must 
transmit NF bits. Since the server’s upload rate is us, the time to distribute the file must be at 
least NF/us. 

e Let dmin denote the download rate of the peer with the lowest download rate, that is, dmin = 
min{d1,dp,....dN}. The peer with the lowest download rate cannot obtain all F bits of the file 
in less than F/dmin seconds. Thus the minimum distribution time is at least F/dmin. 

Putting these two observations together, we obtain 
De. = maf Fh 
Ca een 

In the P2P architecture we make the following observations: 

e At the beginning of the distribution, only the server has the file. To get this file into the 
community of peers, the server must send each bit of the file at least once into its access link. 
Thus, the minimum distribution time is at least F/us. 

e As with the client-server architecture, the peer with the lowest download rate cannot obtain 
all F bits of the file in less than F/dmin seconds. Thus the minimum distribution time is at 


least F/dmin. 
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e Finally, observe that the total upload capacity of the system as a whole is equal to the upload 
rate of the server plus the upload rates of each of the individual peers, that is, utotal = us + ul 
+... + uN. The system must deliver (upload) F bits to each of the N peers, thus delivering a 
total of NF bits. This cannot be done at a rate faster than utotal. Thus, the minimum 
distribution time is also at least NF/(us + ul + ... + uN). 

Putting these three observations together, we obtain the minimum distribution time for P2P, 

denoted by Dp2p. 

i AK NF 

N 


Dui 
= 





Dpy 





= max 4 Us dmin 
us + 


Below Figure compares the minimum distribution time for the client-server and P2P 


architectures assuming that all peers have the same upload rate u. 





Client-Server 








BitTorrent 

e In BitTorrent, the collection of all peers participating in the distribution of a particular file is 
called a torrent. 

e Peers in a torrent download equal-size chunks of the file from one another, with a typical 
chunk size of 256 KBytes. 

e¢ When a peer first joins a torrent, it has no chunks. Over time it accumulates more and more 
chunks. While it downloads chunks it also uploads chunks to other peers. 

¢ Once a peer has acquired the entire file, it may leave the torrent, or remain in the torrent and 
continue to upload chunks to other peers. 

e Also, any peer may leave the torrent at any time with only a subset of chunks, and later rejoin 


the torrent. 
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e Each torrent has an infrastructure node called a tracker. 

e When a peer joins a torrent, it registers itself with the tracker and periodically informs the 
tracker that it is still in the torrent. In this manner, the tracker keeps track of the peers that are 
participating in the torrent. 

e When a new peer joins the torrent, the tracker randomly selects a subset of peers (for 
concreteness, say 50) from the set of participating peers, and sends the IP addresses of these 
50 peers to new peer. 

e Possessing this list of peers, new peer attempts to establish concurrent TCP connections with 
all the peers on this list. All the peers with which new peer succeeds in establishing a TCP 
connection will be called as “neighboring peers.” 

e As time evolves, some of these peers may leave and other peers (outside the initial 50) may 
attempt to establish TCP connections. 

e Periodically, peer will ask each of its neighboring peers (over the TCP connections) for the 
list of the chunks they have. If peer has L different neighbors, it will obtain L lists of chunks. 
With this knowledge, peer will issue requests (again over the TCP connections) for chunks 
currently it does not have. 

e In deciding which chunks to request, peer uses a technique called rarest first. The idea is to 
determine, from among the chunks peer does not have, the chunks that are the rarest among 
its neighbors and then request those rarest chunks first. In this manner, the rarest chunks get 
more quickly redistributed, aiming to equalize the numbers of copies of each chunk in the 
torrent. 


Tracker 
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To determine which requests peer responds to, BitTorrent uses a clever trading algorithm. 
The basic idea is that peer gives priority to the neighbors that are currently supplying data to 
it at the highest rate. Specifically, for each of its neighbors, peer continually measures the 
rate at which it receives bits and determines the four peers that are feeding bits at the highest 
rate. Peer then reciprocates by sending chunks to these same four peers. 

Every 10 seconds, peer recalculates the rates and possibly modifies the set of four peers. 

In BitTorrent lingo, these four peers are said to be unchoked. 

Importantly, every 30 seconds, peer also picks one additional neighbor at random and sends 
it chunks. In BitTorrent lingo, this randomly selected peer is said to be optimistically 
unchoked. 

The random neighbor selection also allows new peers to get chunks, so that they can have 
something to trade. 


The incentive mechanism for trading just described is often referred to as tit-for-tat. 


1.6.2 Distributed Hash Tables (DHTs) 


Centralized version of this simple database will simply contain (key, value) pairs. We query 
the database with a key. If there are one or more key-value pairs in the database that match 
the query key, the database returns the corresponding values. 

Building such a database is straightforward with client-server architecture that stores all the 
(key, value) pairs in one central server. 

P2P version of this database will store the (key, value) pairs over millions of peers. 

In the P2P system, each peer will only hold a small subset of the totality of the (key, value) 
pairs. We’ll allow any peer to query the distributed database with a particular key. The 
distributed database will then locate the peers that have the corresponding (key, value) pairs 
and return the key-value pairs to the querying peer. 

Any peer will also be allowed to insert new key-value pairs into the database. Such a 
distributed database is referred to as a distributed hash table (DHT). 

One naive approach to building a DHT is to randomly scatter the (key, value) pairs across all 


the peers and have each peer maintain a list of the IP addresses of all participating peers. In 





Archana R A, Asso. Prof., Dept. of CSE, SVIT 38 


Module — 1: Application Layer 





this design, the querying peer sends its query to all other peers, and the peers containing the 
(key, value) pairs that match the key can respond with their matching pairs. 

e Such an approach is completely unscalable as it would require each peer to know about all 
other peers and have each query sent to all peers. 

e An elegant approach to designing a DHT is to first assign an identifier to each peer, where 
each identifier is an integer in the range [0, 2"-1] for some fixed n. 

e = This also require each key to be an integer in the same range. 

e To create integers out of such keys, we will use a hash function that maps each key (e.g., 
social security number) to an integer in the range [0, 2"-1]. 

Problem of storing the (key, value) pairs in the DHT: 

e The central issue here is defining a rule for assigning keys to peers. Given that each peer has 
an integer identifier and that each key is also an integer in the same range, a natural approach 
is to assign each (key, value) pair to the peer whose identifier is the closest to the key. 

e To implement such a scheme, let’s define the closest peer as the closest successor of the key. 

e¢ Now suppose a peer, Alice, wants to insert a (key, value) pair into the DHT. Conceptually, 
this is straightforward: She first determines the peer whose identifier is closest to the key; she 
then sends a message to that peer, instructing it to store the (key, value) pair. 

e If Alice were to keep track of all the peers in the system (peer IDs and corresponding IP 
addresses), she could locally determine the closest peer. But such an approach requires each 
peer to keep track of all other peers in the DHT—which is completely impractical for a large- 
scale system with millions of peers. 

Circular DHT 

To address this problem of scale, let’s now consider organizing the peers into a circle. In this 

circular arrangement, each peer only keeps track of its immediate successor and immediate 


predecessor (modulo 2"). 


Who is 
responsible 
Tor key 11? 


3 
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Each peer is only aware of its immediate successor and predecessor; for example, peer 5 knows 
the IP address and identifier for peers 8 and 4 but does not necessarily know anything about any 
other peers that may be in the DHT. 

Now suppose that peer 3 wants to determine which peer in the DHT is responsible for key 11. 
Using the circular overlay, the origin peer (peer 3) creates a message saying “Who is responsible 
for key 11?” and sends this message clockwise around the circle. Whenever a peer receives such 


a message, because it knows the identifier of its successor and predecessor, it can determine 





whether it is responsible for (that is, closest to) the key in question. If a peer is not responsible 
for the key, it simply sends the message to its successor. So, for example, when peer 4 receives 
the message asking about key 11, it determines that it is not responsible for the key (because its 
successor is closer to the key), so it just passes the message along to peer 5. This process 
continues until the message arrives at peer 12, who determines that it is the closest peer to key 
11. At this point, peer 12 can send a message back to the querying peer, peer 3, indicating that it 
is responsible for key 11. 

Although each peer is only aware of two neighboring peers, to find the node responsible for a 
key (in the worst case), all N nodes in the DHT will have to forward a message around the circle; 
N/2 messages are sent on average. 

Shortcuts are used to expedite the routing of query messages. Specifically, when a peer receives 
a message that is querying for a key, it forwards the message to the neighbor (successor neighbor 


or one of the shortcut neighbors) which is the closet to the key. 


10 
8 


When peer 4 receives the message asking about key 11, it determines that the closet peer to the 
key (among its neighbors) is its shortcut neighbor 10 and then forwards the message directly to 


peer 10. Clearly, shortcuts can significantly reduce the number of messages used to process a 


query. 
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Peer Churn 

In P2P systems, a peer can come or go without warning. Thus, when designing a DHT, we also 
must be concerned about maintaining the DHT overlay in the presence of such peer churn. 

To handle peer churn, we will now require each peer to track its first and second successors; for 
example, peer 4 now tracks both peer 5 and peer 8. We also require each peer to periodically 
verify that its two successors are alive 

Let’s now consider how the DHT is maintained when a peer abruptly leaves. For example, 
suppose peer 5 in above figure abruptly leaves. In this case, the two peers preceding the departed 
peer (4 and 3) learn that 5 has departed, since it no longer responds to ping messages. Peers 4 
and 3 thus need to update their successor state information. Let’s consider how peer 4 updates its 
state: 

1. Peer 4 replaces its first successor (peer 5) with its second successor (peer 8). 

2. Peer 4 then asks its new first successor (peer 8) for the identifier and IP address of its 
immediate successor (peer 10). Peer 4 then makes peer 10 its second successor. 

Let’s say a peer with identifier 13 wants to join the DHT, and at the time of joining, it only 
knows about peer 1’s existence in the DHT. Peer 13 would first send peer 1 a message, saying 
“what will be 13’s predecessor and successor?” This message gets forwarded through the DHT 
until it reaches peer 12, who realizes that it will be 13’s predecessor and that its current 
successor, peer 15, will become 13’s successor. Next, peer 12 sends this predecessor and 
successor information to peer 13. Peer 13 can now join the DHT by making peer 15 its successor 


and by notifying peer 12 that it should change its immediate successor to 13. 


1.7 Socket Programming: Creating Network Applications 

eA typical network application consists of a pair of programs—a client program and a server 
program—residing in two different end systems. 

e When these two programs are executed, a client process and a server process are created, and 
these processes communicate with each other by reading from, and writing to, sockets. 

e When creating a network application, the developer’s main task is therefore to write the code 


for both the client and server programs. 
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1.7.1 Socket Programming with UDP 

Before the sending process can push a packet of data out the socket door, when using UDP, it 
must first attach a destination address to the packet. After the packet passes through the sender’s 
socket, the Internet will use this destination address to route the packet through the Internet to the 
socket in the receiving process. When the packet arrives at the receiving socket, the receiving 
process will retrieve the packet through the socket, and then inspect the packet’s contents and 


take appropriate action. 


Example application: 

1. The client reads a line of characters (data) from its keyboard and sends the data to the server. 
2. The server receives the data and converts the characters to uppercase. 

3. The server sends the modified data to the client. 


4. The client receives the modified data and displays the line on its screen. 


Server Client 
(Running on serverIP) 





Create socket, port=x: Create socket: 
serverSocket = clientSocket = 
socket (AF_INET,SOCK_DGRAM) Socket (AF_INET, SOCK_DGRAM) 
| | 
Create daaan with serverlP 
—— and port=x; 
Read UDP segment from send datagram via 
aoe elientSocket 
Write reply to 1 
serverSocket 
specifying client address, —_—— Read datagram from 
eae elientSocket 
Y 
Close 
clientSocket 
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UDPClient.py 


Here is the code for the client side of the application: 





from socket import * 

serverName = ‘hostname’ 

serverPort = 12000 

clientSocket = socket(socket.AF_INET, socket.SOCK_DGRAM) 
message = raw_input(’ Input lowercase sentence:’) 
clientSocket.sendto(message,(serverName, serverPort)) 
modifiedMessage, serverAddress = clientSocket.recvfrom(2048) 
print modifiedMessage 


clientSocket.close() 





UDPServer.py 








from socket import * 

serverPort = 12000 

serverSocket = socket(AF_INET, SOCK_DGRAM) 

serverSocket.bind((’’, serverPort)) 

print ”The server is ready to receive” 

while 1: 
message, clientAddress = serverSocket.recvfrom(2048) 
modifiedMessage = message.upper() 


serverSocket.sendto(modifiedMessage, clientAddress) 





1.7.2 Socket Programming with TCP 


e¢ Unlike UDP, TCP is a connection-oriented protocol. This means that before the client and 
server can start to send data to each other, they first need to handshake and establish a TCP 


connection. 
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© One end of the TCP connection is attached to the client socket and the other end is attached 


to a server socket. 


e When creating the TCP connection, we associate with it the client socket address (IP address 


and port number) and the server socket address (IP address and port number). With the TCP 


connection established, when one side wants to send data to the other side, it just drops the 


data into the TCP connection via its socket. This is different from UDP, for which the server 


must attach a destination address to the packet before dropping it into the socket. 


e During the three-way handshake, the client process knocks on the welcoming door of the 


server process. When the server “hears” the knocking, it creates a new door— more 


precisely, a new socket that is dedicated to that particular client. 


iS 


Client process Server process 





Welcoming 
socket 


\ cere 


\ socket 


TCPClient.py 








from socket import * 

serverName = ’servername’ 

serverPort = 12000 

clientSocket = socket(AF_INET, SOCK_STREAM) 
clientSocket.connect((serverName,serverPort)) 
sentence = raw_input(‘Input lowercase sentence:’) 
clientSocket.send(sentence) 

modifiedSentence = clientSocket.recv(1024) 

print ‘From Server:’, modifiedSentence 


clientSocket.close() 
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server Client 
(Running on servertP) 


Greate socket, port=, 
tor incoming request: 
serversocket = 








socket () 
————> 
, 
Wait for incoming, Led Create socket, connect 
connection request: Connection setup 49 serveriP, port=x: 
connectionSocket = elientSocket = 
server Socket .accept() socket () 
, 
Send request using 
Read requestfrom spray 
connectionsocket 
¥. | 
SIGE Read reply from 
connectionSocket rer) clientsochet 
* { 
_— oe Close 
connectionsocket, clientsocket 
TCPServer.py 








from socket import * 

serverPort = 12000 

serverSocket = socket(AF_INET,SOCK_STREAM) 
serverSocket.bind((‘’,serverPort)) 
serverSocket.listen(1) 

print ‘The server is ready to receive’ 

while 1: 

connectionSocket, addr = serverSocket.accept() 
sentence = connectionSocket.recv(1024) 
capitalizedSentence = sentence.upper() 
connectionSocket.send(capitalizedSentence) 


connectionSocket.close() 
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Module — 5 


MULTIMEDIA NETWORKING 


Multimedia Networking Applications 
> Properties of Video 


Most salient characteristic of video is its high bit rate. 

= Video distributed over the Internet typically ranges from 100,kbps for low-quality video 
conferencing to over 3 Mbps for streaming high-definition movies. 

= Video streaming consumes most bandwidth, having a bitrate of more than ten times 
greater than that of the normal HTTP and music-streaming applications. 

Video can be compressed. 

= A video is a sequence of images, typically being displayed at a.constant rate, for example, 
at 24 or 30 images per second. 

= An uncompressed, digitally encoded image consists of amarray of pixels, with each pixel 
encoded into a number of bits to represent luminance and color. 

= There are two types of redundaney in video, both of which can be exploited by video 
compression. 

= Spatial redundancy, is the,redundancy within a given image. Intuitively, an image that 
consists of mostly white,spaceyhasya high degree of redundancy and can be efficiently 
compresséd,without significantly sacrificing image quality. 

= Temporal redundancy reflects repetition from image to subsequent image. If, for 
example, an image and the subsequent image are exactly the same, there is no reason to 
reencode, the subsequent image; it is instead more efficient simply to indicate during 
encoding that the subsequent image is exactly the same. 

= We canyalso use compression to create multiple versions of the same video, each at a 
different quality level. For example, we can use compression to create, say, three versions 


of the same video, at rates of 300 kbps, 1 Mbps, and 3 Mbps. 


> Properties of Audio 


Digital audio has significantly lower bandwidth requirements than video. 
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Analog audio can be converted to a digital signal using pulse code modulation with the 

following steps: 

= The analog audio signal is sampled at some fixed rate. 

= Each of the samples is then rounded to one of a finite number of values. This operation is 
referred to as quantization. The number of such finite values called quantization values. 

= Each of the quantization values are encoded by representing with a fixed number of bits. 

PCM-encoded speech and music, however, are rarely used in the Internet, Instead, as with 

video, compression techniques are used to reduce the bit rates of the stream. 

A popular compression technique for near CD-quality stereo)music is MPEG.1 layer 3, more 

commonly known as MP3. 

MP3 encoders can compress to many different rates; 128 kbps.is the most common encoding 

rate and produces very little sound degradation. 

As with video, multiple versions of a prerecorded audio stream can be created, each at a 


different bit rate. 


> Types of Multimedia Network Applications 


Multimedia applications are classified,into three broad)categories: 


(i) Streaming stored audio/video 


(ii) Conversational yoice/video-over-IP 


(iii) Streaming live audio/video 


1) Streaming Stored Audio and, Video 


In this ¢lass of applications, the underlying medium is prerecorded video, such as a movie, a 
television show ,a prerecorded sporting event, or a prerecorded user generated video (such as 
those commonly seen on YouTube). 

These prerecorded videos are placed on servers, and users send requests to the servers to 
view the videos on demand. 

Many Internet companies today provide streaming video, including YouTube (Google), 
Netflix, and Hulu. 

By some estimates, streaming stored video makes up over 50 percent of the downstream 


traffic in the Internet access networks today. 
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Streaming stored video has three key distinguishing features. 


e Streaming: In a streaming stored video application, the client typically begins video 
playout within a few seconds after it begins receiving the video from the server. This 
means that the client will be playing out from one location in the video while at the same 
time receiving later parts of the video from the server. This technique, known as 


streaming, avoids having to download the entire video file before playout begins. 





¢ Interactivity: Because the media is prerecorded, the user may pause, reposition forward, 
reposition backward, fast-forward, and so on through the video)content. The time from 
when the user makes such a request until the action manifests itself at the,client should be 
less than a few seconds for acceptable responsiveness. 

¢ Continuous playout: Once playout of the video begins, it should proceed according to 
the original timing of the recording. Therefore, data must be received from the server in 


time for its playout at the client; otherwise, users. experience)yideo frame freezing or 


frame skipping. 


2) Conversational Voice- and Video-oyer-IP 


Real-time conversational voice)over the Internetiis often referred to as Internet telephony. It 
is also commonly called Voice-over=IP (VoIP). 

Conversational video is similar)except that it includes the video of the participants as well as 
their voices. 

Most of today’s,voice and video,conversational systems allow users to create conferences 
with three or more participants. 

Conyersational voice and video are widely used in the Internet today, with the Internet 
companies Skype, QQ; and Google Talk boasting hundreds of millions of daily users. 

Timing considerations and tolerance of data loss are important for conversational voice and 
video applications. 

Timing considerations are important because audio and video conversational applications are 
highly delay-sensitive. For a conversation with two or more interacting speakers, the delay 
from when a user speaks or moves until the action is manifested at the other end should be 


less than a few hundred milliseconds. 
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On the other hand, conversational multimedia applications are loss-tolerant— occasional loss 
only causes occasional glitches in audio/video playback, and these losses can often be 


partially or fully concealed. 


3) Streaming Live Audio and Video 


This third class of applications is similar to traditional broadcast radio and television, except 
that transmission takes place over the Internet. 

These applications allow a user to receive a live radio or television transmission—such as a 
live sporting event or an ongoing news event—transmitted from any corner of the world. 
Today, thousands of radio and television stations around the world are broadcasting content 
over the Internet. 

Live, broadcast-like applications often have many,users who receive the same audio/video 
program at the same time. 

Although the distribution of live audio/video to many receivers can be efficiently 
accomplished using the IP multicasting techniques, multicast distribution is more often 
accomplished today via application-layer multicast (using P2P networks or CDNs) or 
through multiple separateyunicast streams» 

As with streaming stored multimedia, the network must provide each live multimedia flow 
with an averagethroughput,that is,larger than the video consumption rate. Because the event 
is live, delay.can also be an issue, although the timing constraints are much less stringent 


than those for conyersational voice. 


Streaming Stored, eo 


For streaming video applications, prerecorded videos are placed on servers, and users send 
requests to these servers to view the videos on demand. 

The user may watch the video from beginning to end without interruption, may stop 
watching the video well before it ends, or interact with the video by pausing or repositioning 
to a future or past scene. 

Streaming video systems can be classified into three categories: 


1. UDP streaming 
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2. HTTP streaming 
3. Adaptive HTTP streaming. 


A common characteristic of all three forms of video streaming is the extensive use of client- 
side application buffering to mitigate the effects of varying end-to-end delays and varying 
amounts of available bandwidth between server and client. 

When the video starts to arrive at the client, the client need not immediately begin playout, 
but can instead build up a reserve of video in an application buffer. Once the client has built 
up a reserve of several seconds of buffered-but-not-yet-played video, the,client can then 
begin video playout. 

There are two important advantages provided by such client buffering. First, client side 
buffering can absorb variations in server-to-client delay. Second, if the server-to-client 
bandwidth briefly drops below the video consumption rate, a use, can continue to enjoy 
continuous playback, again as longyas the client application buffer does not become 


completely drained. 


> UDP Streaming 


With UDP streaming, the server transmits video at a rate that matches the client’s video 
consumption rate)by clocking out the video chunks over UDP at a steady rate. 

For exampleyif.the video consumption rate is 2 Mbps and each UDP packet carries 8,000 bits 
of video, then the server would transmit one UDP packet into its socket every (8000 bits)/(2 
Mbps) = 4 msec. 

UDP does not employ.a congestion-control mechanism, the server can push packets into the 
network at the consumption rate of the video without the rate-control restrictions of TCP. 
Before passing the video chunks to UDP, the server will encapsulate the video chunks within 
transport packets specially designed for transporting audio and video, using the Real-Time 
Transport Protocol (RTP). 

The client and server also maintain, in parallel, a separate control connection over which the 


client sends commands regarding session state changes (such as pause, resume, reposition, 
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and so on). The Real-Time Streaming Protocol is a popular open protocol for such a control 
connection. 

Limitation: 

e Due to the unpredictable and varying amount of available bandwidth between server and 
client, constant-rate UDP streaming can fail to provide continuous playout. 

e It requires a media control server, such as an RTSP server, to process client-to-server 
interactivity requests and to track client state for each ongoing client session, 

¢ Many firewalls are configured to block UDP traffic, preventing the users, behind these 


firewalls from receiving UDP video. 


> HTTP Streaming 

¢ In HTTP streaming, the video is simply stored in an HTTP server as an ordinary file with a 
specific URL. 

e When a user wants to see the videoythe,client establishes a,TCP connection with the server 
and issues an HTTP GET request for that URL. 

¢ The server then sends the video file, within an HTTP response message, as quickly as 
possible, that is, as quicklyas TCP congestion control and flow control will allow. 

© On the client side, the bytes are collected in a’client application buffer. Once the number of 
bytes in this buffer exceeds a predetermined threshold, the client application begins 
playback—specifically, it periodically grabs video frames from the client application buffer, 


decompresses the frames, and displays them on the user’s screen. 


Advantages: 
¢ The use of HTTP over TCP also allows the video to traverse firewalls and NATs more easily. 
e Streaming over HTTP also obviates the need for a media control server, such as an RTSP 


server, reducing the cost of a large-scale deployment over the Internet. 


Limitation and solution: 
When transferring a file over TCP, the server-to client transmission rate can vary significantly 


due to TCP’s congestion control mechanism. Packets can also be significantly delayed due to 
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TCP’s retransmission mechanism. Because of these characteristics of TCP, it was believed that 
video streaming would never work well over TCP. Over time, however, designers of streaming 
video systems learned that TCP’s congestion control and reliable-data transfer mechanisms do 


not necessarily preclude continuous playout when client buffering and prefetching are used. 


Prefetching Video: 
The client can attempt to download the video at a rate higher than the consumption rate, thereby 
prefetching video frames that are to be consumed in the future. This prefetched video is naturally 


stored in the client application buffer. 


Client Application Buffer and TCP Buffers: 








VA 
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from buffer, 
TCP send TCP receive TCP application decompressed, 
buffer buffer buffer and displayed 
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Video file 


\a Web server 


e Here TCP send buffer is shown to be full, the server is momentarily prevented from sending 




















more bytes from the video file into the socket. 

© On the client side, the client application reads bytes from the TCP receive buffer and places 
the bytes into theyclient application buffer. 

e At the same time, the client application periodically grabs video frames from the client 
application buffer, decompresses the frames, and displays them on the user’s screen. 

e Consider now what happens when the user pauses the video during the streaming process. 
During the pause period, bits are not removed from the client application buffer, even though 
bits continue to enter the buffer from the server. If the client application buffer is finite, it 


may eventually become full, which will cause “back pressure” all the way back to the server. 
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Analysis of Video Streaming: 


8B 
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Fill rate =x Depletion rate =r 
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Video 
server 


Client application buffer 


¢ Let B denote the size (in bits) of the client’s application bufferyand let Q denote the number 
of bits that must be buffered before the client application begins, playout. 

¢ Let r denote the video consumption rate—the rate atywhich the client draws bits out of the 
client application buffer during playback. 

¢ Let’s assume that the server sends bits atya constantrate x whenever the client buffer is not 
full. 

e Ifx<r (that is, ifthe seryer sendrate is less than the video consumption rate), then the client 
buffer will never become full, 

e When the available rate imthe network is)more than the video rate, after the initial buffering 


delay, the user will enjoy continuous playout until the video ends. 


Early Terminationand Repositioning the Video: 

> HTTP streaming systems often make use of the HTTP byte-range header in the HTTP GET 
request\message, which specifies the specific range of bytes the client currently wants to 
retrieve from the desired video. 

> This is particularly useful when the user wants to reposition (that is, jump) to a future point in 
time in the video. 

> When the user repositions to a new position, the client sends a new HTTP request, indicating 
with the byte-range header from which byte in the file should the server send data. 

> When the server receives the new HTTP request, it can forget about any earlier request and 


instead send bytes beginning with the byte indicated in the byterange request. 
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> Adaptive Streaming and DASH 

Shortcoming of HTTP Streaming: 

All clients receive the same encoding of the video, despite the large variations in the amount of 

bandwidth available to a client, both across different clients and also over time for the same 

client. 

Solution: DASH 

e In DASH - Dynamic Adaptive Streaming over HTTP, the video is encoded into several 
different versions, with each version having a different bit rate and, correspondingly, a 
different quality level. The client dynamically requests chunks of video ségments of a few 
seconds in length from the different versions. 

e With DASH, each video version is stored in the HTTP server, each with a different URL. 

¢ The HTTP server also has a manifest file, which provides a URL, for each version along with 
its bit rate. 

¢ The client first requests the manifest file and learns about the various versions. 

¢ The client then selects one chunksat a time by specifying a URL and a byte range in an HTTP 
GET request message for each)chunk, 

e While downloading chunks, the client also. measures the received bandwidth and runs a rate 
determination algorithm.to select the chunk torequest next. 

e Naturally, if the client has a lot of video buffered and if the measured receive bandwidth is 
high, it willychoose achunk from a high-rate version. And naturally if the client has little 
video buffered and the measured received bandwidth is low, it will choose a chunk from a 
low=rate version. 

¢ By dynamically monitoring the available bandwidth and client buffer level, and adjusting the 
transmission rate)with version switching, DASH can often achieve continuous playout at the 


best possible quality level without frame freezing or skipping. 


Content Distribution Networks 


e Streaming stored video to locations all over the world while providing continuous playout 


and high interactivity is clearly a challenging task. 
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e For an Internet video company, the most straightforward approach to providing streaming 
video service is to build a single massive data center which stores all of its videos in the data 
center, and stream the videos directly from the data center to clients worldwide. 

e But this approach faces some problems: 
= Single massive date center is single point of failure 
= It leads long path to distant clients 
= It may create network congestion. 
= Popular video will likely be sent many times over the same communication links. Not 

only does this waste network bandwidth, but the Internet video company itself will be 
paying its provider ISP (connected to the data center) for sending the same bytes into the 
Internet over and over again. 
e In order to meet the challenge of distributing massive amounts of video data to users 
distributed around the world, almost all‘major video-streaming companies make use of 
Content Distribution Networks (CDNs). 
e¢ ACDN manages servers in multiple geographically distributed locations, stores copies of the 
videos (and other types of Web content, including documents, images, and audio) in its 
servers, and attempts to direct each user request to’a CDN location that will provide the best 
user experience. 
¢ The CDN may be a private CDN,that is, owned by the content provider itself; for example, 
Google’s CDN distributes YouTube videos and other types of content. 
e The CDN may alternatively be @ third-party CDN that distributes content on behalf of 
multiple content providers; for example, Akamai’s CDN is a third party CDN that distributes 
Netflix and Hulu content, among others. 
¢ CDNs typically adopt one of two different server placement philosophies: 
= Enter Deep: One philosophy, pioneered by Akamai, is to enter deep into the access 
networks of Internet Service Providers, by deploying server clusters in access ISPs all 
over the world. The goal is to get close to end users, thereby improving user-perceived 
delay and throughput by decreasing the number of links and routers between the end user 
and the CDN cluster from which it receives content. 

= Bring Home: A second design philosophy, taken by Limelight and many other CDN 


companies, is to bring the ISPs home by building large clusters at a smaller number (for 
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example, tens) of key locations and connecting these clusters using a private high-speed 
network. Instead of getting inside the access ISPs, these CDNs typically place each 
cluster at a location that is simultaneously near the PoPs of many tier-1 ISPs 

> CDN Operation 

When a browser in a user’s host is instructed to retrieve a specific video (identified by a URL), 

the CDN must intercept the request so that it can 

(1) Determine a suitable CDN server cluster for that client at that time. 


(2) Redirect the client’s request to a server in that cluster. 


www.NetCinema.com 


—— _ 
| .—_ © 
—— @- we NetCinema authoritative 
DNS server 
Local 
Na DNS server @ Se 
KingCDN authoritative 
server 





KingCDN content 
distribution server 


1. The user visits the Web page at NetCinema. 

2. When the userclicks onthe link http://video.netcinema.com/6Y7B23V, the user’s host sends 
a DNS query for video.netcinema.com. 

3. The user’s LocalDNS Server (LDNS) relays the DNS query to an authoritative DNS server 
for NetCinema, which observes the string “video” in the hostname video.netcinema.com. To 
“hand over” the DNS query to KingCDN, instead of returning an IP address, the NetCinema 
authoritative DNS server returns to the LDNS a hostname in the KingCDN’s domain, for 
example, al 105.kingcedn.com. 

4, From this point on, the DNS query enters into KingCDN’s private DNS infrastructure. The 
user’s LDNS then sends a second query, now for a1105.kingcdn.com, and KingCDN’s DNS 


system eventually returns the IP addresses of a KingCDN content server to the LDNS. It is 





Abhijith H V, Asst. Prof., Dept. of ISE, SVIT 11 


Module — 5: Multimedia Networking 





thus here, within the KingCDN’s DNS system, that the CDN server from which the client 
will receive its content is specified. 

The LDNS forwards the IP address of the content-serving CDN node to the user’s host. 

Once the client receives the IP address for a KingCDN content server, it establishes a direct 
TCP connection with the server at that IP address and issues an HTTP GET request for the 
video. If DASH is used, the server will first send to the client a manifest file with a list of 
URLs, one for each version of the video, and the client will dynamically select chunks from 


the different versions. 


> Cluster Selection Strategies 


Cluster Selection Strategies is a mechanism for dynamically directing clients to a server 
cluster or a data center within the CDN. 

The CDN learns the IP address of the client’s LDNS,server via the client’s DNS lookup. 
After learning this IP address, the CDN needs to select an appropriate cluster based on this IP 
address. 

One simple strategy is to assign the client to the cluster that is geographically closest. Using 
commercial geo-locatiom databases each LDNS IP address is mapped to a geographic 
location. When a DNS, request is received from a particular LDNS, the CDN chooses the 
geographically closest cluster. 

For some clients, thesolution may perform poorly, since the geographically closest cluster 
may not be the closest cluster along the network path. 

Inorder to determine the best cluster for a client based on the current traffic conditions, 
CDNSs caipinstead)perform periodic real-time measurements of delay and loss performance 
between their clusters and clients. 

An alternative to sending extraneous traffic for measuring path properties is to use the 
characteristics of recent and ongoing traffic between the clients and CDN servers. 

Such solutions, however, require redirecting clients to (possibly) suboptimal clusters from 
time to time in order to measure the properties of paths to these clusters. 

A very different approach to matching clients with CDN servers is to use IP anycast. The 
idea behind IP anycast is to have the routers in the Internet route the client’s packets to the 


“closest” cluster, as determined by BGP. 
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e During the IP-anycast configuration stage, the CDN company assigns the same IP address to 
each of its clusters, and uses standard BGP to advertise this IP address from each of the 
different cluster locations. 

e When a BGP router receives multiple route advertisements for this same IP address, it treats 
these advertisements as providing different paths to the same physical location. 

e Following standard operating procedures, the BGP router will then pick the “best” route to 
the IP address according to its local route selection mechanism. 

e After this initial configuration phase, the CDN can do its main job)of distributing content. 
When any client wants to see any video, the CDN’s DNS returns the anyeastyaddress, no 
matter where the client is located. 

© When the client sends a packet to that IP address, the packet isrouted)to the “closest” cluster 


as determined by the preconfigured forwarding tables, which were configured with BGP. 






Receive BGP 
advertisements for 
212.21.21.21 from 
AS1 and from AS4. 
Forward towards 
Server B since it is 
closer. 






Advertise 
212.21.21.21 







Advertise 
212.21.21.21 


DN Server A 


DN Server B 


> Case Studies: Netflix, YouTube, and Kankan 

Netflix 

© Netflix is the leading service provider for online movies and TV shows in the United States. 
e In order to rapidly deploy its large-scale service, Netflix has made extensive use of third- 


party cloud services and CDNs. Indeed, Netflix is an interesting example of a company 
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deploying a large-scale online service by renting servers, bandwidth, storage, and database 
services from third parties while using hardly any infrastructure of its own. 


e Basic Architecture: 


Amazon Cloud Upload 
versions 


ul \i \ a — . 
b l i server 
Manifest 
Registration file 
and payment 
Netflix 


registration and CDN server CDN server 
Video 
chunks 


payment servers 
(DASH) 






Client 


© Netflix has four major components: the,registration and payment servers, the Amazon cloud, 
multiple CDN providers, and clients) 
¢ In its own hardware infrastructure, Netflix maintains registration and payment servers, which 
handle registrationjof new accounts and capture credit-card payment information. 
© Netflix runs its online service byemploying machines (or virtual machines) in the Amazon 
cloud. Somejof the#functions, taking place in the Amazon cloud include: 
= Content ingestion: Before Netflix can distribute a movie to its customers, it must first 
ingest and, process the movie. Netflix receives studio master versions of movies and 
uploads them to’hosts in the Amazon cloud. 
= Content processing: The machines in the Amazon cloud create many different formats 
for each movie, suitable for a diverse array of client video players running on desktop 
computers, smartphones, and game consoles connected to televisions. A different version 
is created for each of these formats and at multiple bit rates, allowing for adaptive 


streaming over HTTP using DASH. 
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= Uploading versions to the CDNs: Once all of the versions of a movie have been created, 
the hosts in the Amazon cloud upload the versions to the CDNs. 

The Web pages for browsing the Netflix video library are served from servers in the Amazon 

cloud. 

When the user selects a movie to “Play Now,” the user’s client obtains a manifest file, also 

from servers in the Amazon cloud. The manifest file includes a variety of information, 

including a ranked list of CDNs and the URLs for the different versions of the movie, which 

are used for DASH playback. 

The ranking of the CDNs is determined by Netflix, and may change from one streaming 

session to the next. 

Typically the client will select the CDN that is ranked highest in the manifest file. 

After the client selects a CDN, the CDN leverages DNS to redirect the client to a specific 

CDN server. 

The client and that CDN server then interact using DASH. 


Youtube: 


With approximately halfia billion videos in its library and half a billion video views per day, 
YouTube is indisputably the world’s largest video-sharing site. 

YouTube beganiits service in April.2005 and was acquired by Google in November 2006. 
Google doesnot employ third-party CDNs but instead uses its own private CDN to distribute 
YouTube videos. 

Google has installed server clusters in many hundreds of different locations. From a subset of 
about)50 ofthese locations, Google distributes YouTube video. 

Google uses DNS)to redirect a customer request to a specific cluster. 

Most of the time, 

Google’s cluster selection strategy directs the client to the cluster for which the RTT between 
client and cluster is the lowest; however, in order to balance the load across clusters, 
sometimes the client is directed (via DNS) to a more distant cluster. 

If a cluster does not have the requested video, instead of fetching it from somewhere else and 
relaying it to the client, the cluster may return an HTTP redirect message, thereby redirecting 


the client to another cluster. 
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YouTube employs HTTP streaming. YouTube often makes a small number of different 
versions available for a video, each with a different bit rate and corresponding quality level. 

YouTube processes each video it receives, converting it to a YouTube video format and 
creating multiple versions at different bit rates. This processing takes place entirely within 


Google data centers. 


Kankan 


e Kankan allows the service provider to significantly reduce its infrastructure and 
bandwidth costs. 

e This approach uses P2P delivery instead of client-server (via CDNs) delivery. P2P video 
delivery is used with great success by several companies in, China, including Kankan 
(owned and operated by Xunlei), PPTV (formerly PPLive),and PPs (formerly PPstream). 

e Kankan, currently the leading P2P-based video-on-demand)provider in China, has over 
20 million unique users viewing its videos every month. 

e Ata high level, P2P video streaming is very similar to BitTorrent file downloading. 

e When a peer wants to see a video, it contacts a tracker (which may be centralized or peer- 
based using a DHT) to discover other peers in the system that have a copy of that video. 

e This peer then requests chunks, of the video file in parallel from these other peers that 
have the file. 

e Different from downloading with BitTorrent, however, requests are preferentially made 
for chunks, that are‘to be played back in the near future in order to ensure continuous 
playback. 


¢ The Kankan design employs a tracker and its own DHT for tracking content. 


Network Support for Multimedia 





There exist three broad approaches towards providing network-level support for multimedia 


applications. 
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Approach Gronulority  Guorantee © Mechonisms Complexity Deployment to date 
‘Making the all traffic none, or opplication- minimal everywhere 
best of best —treated soft layer support, 
effort service. equally CONs, overlays, 
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fesource 
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service asses of or soft policing, 
traffic scheduling 
treated 
differently 
Per-connection each soft or hard, packet marking, light little 


Quality-of SOUrce- once flow policing, scheduling; 
Service (QoS) destination is admitted call admission and 
Guarantees flows treated signaling 

differently 





e Making the best of best-effort service: The application-level mechanisms and infrastructure 
can be successfully used in a well-dimensioned network where packet loss and excessive 
end-to-end delay rarely occur, When demand, increases are forecasted, the ISPs deploy 
additional bandwidth and,switching capacity to continue to ensure satisfactory delay and 
packet-loss performance. 

¢ Differentiated service: With differentiated service, one type of traffic might be given strict 
priority overanother elass of traffic when both types of traffic are queued at a router. 

e Per-connection Quality-of-Service (QoS) Guarantees: With per-connection QoS 
guarantees, each instance of an application explicitly reserves end-to-end bandwidth and thus 
has ‘guaranteed end-to-end performance. A hard guarantee means the application will 
receive its requested quality of service (QoS) with certainty. A soft guarantee means the 


application will receive its requested quality of service with high probability. 


> Dimensioning Best-Effort Networks 


e A first approach to improving the quality of multimedia applications is through providing 


enough link capacity throughout the network so that network congestion, and its consequent 
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packet delay and loss, never (or only very rarely) occurs. With enough link capacity, packets 
could zip through today’s Internet without queuing delay or loss. 
e Challenges: 

= The question of how much capacity to provide at network links in a given topology to 
achieve a given level of performance is often known as bandwidth provisioning. 

= The even more complicated problem of how to design a network, topology (where to 
place routers, how to interconnect routers with links, and what capacity to assign to links) 
to achieve a given level of end-to-end performance is a network design, problem often 


referred to as network dimensioning. 


> Providing Multiple Classes of Service 

The simplest enhancement to the one-size-fits-all best-effort service in today’s Internet is to 
divide traffic into classes, and provide different levels of service to these different classes of 
traffic. 

The type-of-service (ToS) field in the IPv4 header can bewused for this purpose. 


Motivating Scenarios 





~~ & 





Here H1 and H3 are using audio application, H2 and H4 are using HTTP web application. 
e In the best-effort Internet, the audio and HTTP packets are mixed in the output queue at R1 


and (typically) transmitted in a first-in-first-out (FIFO) order. 





Abhijith H V, Asst. Prof., Dept. of ISE, SVIT 18 


Module — 5: Multimedia Networking 





e In this scenario, a burst of packets from the Web server could potentially fill up the queue, 
causing IP audio packets to be excessively delayed or lost due to buffer overflow at R1. 

e Solution for this is differentiating traffic class and assigning suitable priority to it. 

e Packet marking allows a router to distinguish among packets belonging to different classes 
of traffic. 

e Now suppose that the router is configured to give priority to packets marked as belonging to 
the 1 Mbps audio application. Since the outgoing link speed is 1.5 Mbps, even though the 
HTTP packets receive lower priority, they can still, on averagejreceive.0.5 Mbps of 
transmission service. But if the audio application starts sending packets at arate of 1.5 Mbps 
or higher, the HTTP packets will starve, that is, they will not receive any service on the R1- 
to-R2 link. 

¢ Therefore it is desirable to provide a degree of traffic isolation among classes so that one 
class is not adversely affected by another class of traffie,that misbehayes. 

¢ Ifa traffic class or flow must meet certain criteriay then a policing mechanism can be put into 
place to ensure that these criteria,are indeed observed, If the policed application misbehaves, 
the policing mechanism will take some action so that the traffic actually entering the network 
conforms to the criteria. 

¢ Acomplementary approach for providing isolation among traffic classes is for the link-level 
packet-scheduling mechanism to explicitly allocate a fixed amount of link bandwidth to each 
class. 

¢ While providing isolation among classes or flows, it is desirable to use resources (for 


example, link bandwidth and buffers) as efficiently as possible. 


> Scheduling Mechanisms 
First-In-First-Out (FIFO) 
The FIFO (also known as first-come-first-served, or FCFS) scheduling discipline selects packets 


for link transmission in the same order in which they arrived at the output link queue. 
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Priority Queuing 

Under priority queuing, packets arriving at the output link are classified into priority classes at 
the output queue. 

Each priority class typically has its own queue. When choosing a’packet to,transmit, the priority 
queuing discipline will transmit a packet from the highest priority class that has a nonempty 


queue. The choice among packets in the same priority class is typically done in a FIFO manner. 
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Round Robin 

Under the round robin queuing discipline, packets are sorted into classes as with priority 
queuing. 

However, rather than there being a strict priority of service among classes, a round robin 
scheduler alternates service among the classes. 

In the simplest form of round robin scheduling, a class 1 packet is transmitted, followed by a 
class 2 packet, followed by a class | packet, followed by a class 2 packet, and so on. 

A work-conserving round robin discipline that looks for a packet of a given class but finds none 


will immediately check the next class in the round robin sequence. 
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Weighted Fair Queuing (WFQ) 

A generalized abstraction of.round robin queuing that has found considerable use in QoS 
architectures is weighted fair queuing (WFQ) discipline. 

Here arriving packets are classified and queued in the appropriate per-class waiting area. As in 
round robin scheduling, a WFQ scheduler will serve classes in a circular manner—first serving 
class 1, then serving class 2),then serving class 3, and then (assuming there are three classes) 
repeating the service pattern. 

WFQ is also’awork-conserying queuing discipline and thus will immediately move on to the 


next class imthe/service sequence when it finds an empty class queue. 
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> Policing: The Leaky Bucket 


Three important policing criteria: 


Average rate: The network may wish to limit the long-term average rate (packets per time 
interval) at which a flow’s packets can be sent into the network. A crucial issue here is the 
interval of time over which the average rate will be policed. 

Peak rate: While the average-rate constraint limits the amount of traffie)that can be sent into 
the network over a relatively long period of time, a peak-rate constraint limits the maximum 
number of packets that can be sent over a shorter period of time. 

Burst size: The network may also wish to limit the maximumnumber of packets (the “burst” 


of packets) that can be sent into the network over an extremely short interval of time. 


The leaky bucket mechanism is an abstraction that can be used to, characterize these policing 


limits. 


r tokeneeec| 


Bucket holds 
up to 
b tokens 


Packets & ‘ies 


wait area 


To network 








A leaky bucket consists of a bucket that can hold up to b tokens. 

Tokens are/added to,this bucket as follows. New tokens, which may potentially be added to 
the bucket, are always being generated at a rate of r tokens per second. 

If the bucket is filled with less than b tokens when a token is generated, thenewly generated 
token is added to the bucket; otherwise the newly generated token is ignored, and the token 
bucket remains full with b tokens. 

Suppose that before a packet is transmitted into the network, it must first remove a token 


from the token bucket. If the token bucket is empty, the packet must wait for a token. 
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e Because there can be at most b tokens in the bucket, the maximum burst size for a leaky- 
bucket policed flow is b packets. Furthermore, because the token generation rate is r, the 
maximum number of packets that can enter the network of any interval of time of length t is 
tt+b. 


e Leaky Bucket + Weighted Fair Queuing = Provable Maximum Delay in a Queue 
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Diffserv provides service differentiation—that is, the ability to handle different classes of traffic 
in different ways within the Internet.in a sealable manner. 

The need for scalability arises from the fact that millions of simultaneous source-destination 
traffic flows may be presentat a backbone router. 

The Diffserv architecture consists of two sets of functional elements: 

1) Edge functions: packet classification and traffic conditioning. At the incoming edge of the 
network arriving packets are marked. The mark that a packet receives identifies the class of 
traffic to which it belongs. Different classes of traffic will then receive different service within 
the core network. 

2) Core function: forwarding. When a DS-marked packet arrives at a Diffserv capable router, 
the packet is forwarded onto its next hop according to the so-called per-hop behavior (PHB) 
associated with that packet’s class. The per-hop behavior influences how a router’s buffers and 


link bandwidth are shared among the competing classes of traffic. 
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Key: 


GD leat router GB Core router 


Packets arriving to the edge router are first classified. The classifier selects packets based on 


the values of one or more packet,header fields (for example, source address, destination 
address, source port, destinatiom,port, and protocol ID) and steers the packet to the 
appropriate marking function. 

In some cases, an end userumay have agreed to limit its packet-sending rate to conform to a 
declared traffic profile. The traffic profile might contain a limit on the peak rate, as well as 
the burstiness of the packet flow. 

As long as the user sends packets, into the network in a way that conforms to the negotiated 
traffic profile, the packets receive their priority marking and are forwarded along their route 
to theidestination, 

On the, otherhand, if the traffic profile is violated, out-of-profile packets might be marked 
differently; might be shaped (for example, delayed so that a maximum rate constraint would 
be observed), or might be dropped at the network edge. 

The role of the metering function, is to compare the incoming packet flow with the 
negotiated traffic profile and to determine whether a packet is within the negotiated traffic 
profile. 

The second key component of the Diffserv architecture involves the per-hop behavior (PHB) 


performed by Diffserv-capable routers. PHB is rather cryptically, but carefully, defined as “a 
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description of the externally observable forwarding behavior of a Diffserv node applied to a 
particular Diffserv behavior aggregate”. 


e A PHB can result in different classes of traffic receiving different performance. 








Ld 7 Shaper! Y — 
Classifier Marker > 
Packets Dropper | Forward 
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° The expedited forwarding PHB specifies that the departure rate ofa class of traffic from a 


























router must equal or exceed a configured rate. 
e The assured forwarding PHB,divides traffic into)four classes, where each AF class is 


guaranteed to be provided with. some minimum)amount of bandwidth and buffering. 


Per-Connection Quality-of-Service (QoS) Guarantees: Resource 





Reservation and Call Admission 

Consider two 1 Mbps audio. applications transmitting their packets over the 1.5 Mbps link. The 
combined data rate of thestwo flows (2 Mbps) exceeds the link capacity. 

There is simply notenough bandwidth to accommodate the needs of both applications at the 
same time) If the)two applications equally share the bandwidth, each application would lose 25 


percent of its transmitted packets. 






15 Mbps link 
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If sufficient resources will not always be available, and QoS is to be guaranteed, a call admission 
process is needed in which flows declare their QoS requirements and are then either admitted to 
the network (at the required QoS) or blocked from the network (if the required QoS cannot be 
provided by the network). 

The process of having a flow declare its QoS requirement, and then having the network either 
accept the flow (at the required QoS) or block the flow is referred to as the call admission 
process. 

Resource reservation: The only way to guarantee that a call will have the resources (link 
bandwidth, buffers) needed to meet its desired QoS is to explicitly allocate those resources to the 
call—a process known in networking parlance as resource reservation. Once resources are 
reserved, the call has on-demand access to these resources throughout its, duration, regardless of 
the demands of all other calls. If a call reserves and receives a guaranteejof x Mbps of link 
bandwidth, and never transmits at a rate greater than x)the call wilhsee loss- and delay-free 
performance. 

Call admission: If resources are to bé reserved, then the network must have a mechanism for 
calls to request and reserve resources) Since resources are not infinite, a call making a call 
admission request will be denied admission)that is, beblocked, if the requested resources are not 
available. Such a call admission.is performed by the telephone network—we request resources 
when we dial a number. If the circuits (TDMA slots) needed to complete the call are available, 
the circuits are allocated andthe call is completed. If the circuits are not available, then the call is 
blocked, and’ we receive a busy signal. A blocked call can try again to gain admission to the 
network, but it isnot allowed to)send traffic into the network until it has successfully completed 
the call admission process. Of course, a router that allocates link bandwidth should not allocate 
more than ois available at that link. Typically, a call may reserve only a fraction of the link’s 
bandwidth, and so a router may allocate link bandwidth to more than one call. However, the sum 
of the allocated bandwidth to all calls should be less than the link capacity if hard quality of 
service guarantees are to be provided. 

Call setup signaling: The call admission process described above requires that a call be able to 
reserve sufficient resources at each and every network router on its source-to-destination path to 
ensure that its end-to-end QoS requirement is met. Each router must determine the local 


resources required by the session, consider the amounts of its resources that are already 
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committed to other ongoing sessions, and determine whether it has sufficient resources to satisfy 
the per-hop QoS requirement of the session at this router without violating local QoS guarantees 
made to an already-admitted session. A signaling protocol is needed to coordinate these various 
activities—the per-hop allocation of local resources, as well as the overall end-to-end decision of 
whether or not the call has been able to reserve sufficient resources at each and every router on 
the end-to-end path. The RSVP protocol was proposed for this purpose within an Internet 
architecture for providing qualityof- service guarantees. 


QoS call signaling setup 





Request/reply 
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Module — 4 


Wireless and Mobile Network 


Cellular Internet Access 


Cellular telephony is now ubiquitous in many areas throughout the world as it support not 
only voice telephony but wireless Internet access as well. 

Ideally, this Internet access would be at a reasonably high speed and would provide for 
seamless mobility, allowing users to maintain their TCP sessions while traveling, for 


example, on a bus or a train. 


An Overview of Cellular Network Architecture 


Nearly 80% of total Cellular networkuusers adopt Global System for Mobile communication 
(GSM) Standard. 

First generation (1G) systemsjwere analog FDMA systems designed exclusively for voice- 
only communication. These)1G systems)are almost extinct now, having been replaced by 
digital 2G systems. 

The original 2G,systems wereyalso designed for voice, but later extended (2.5G) to support 
data (i.e., Internet) as, well as,yoice)service. 

The 3G systems that currently are being deployed also support voice and data, but with an 


ever increasing emphasis on data capabilities and higher-speed radio access links. 


2G Cellular Network 


The term cellular refers toythe fact that the region covered by a cellular network is partitioned 


into a number of geographic coverage areas, known as cells. 


Each cell contains a base transceiver station (BTS) that transmits signals to and receives signals 


from the mobile stations in its cell. 


The coverage area of a cell depends on many factors, including the transmitting power of the 


BTS, the transmitting power of the user devices, obstructing buildings in the cell, and the height 


of base station antennas. 
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The GSM standard for 2G cellular ‘systems uses combined FDM/TDM (radio) for the air 
interface. In combined FDM/TDM systems, the)channel is partitioned into a number of 
frequency sub-bands; within each sub+band, time is partitioned into frames and slots. 

GSM systems consist of 200-kHzfrequency bands with each band supporting eight TDM 
calls. GSM encodes speech at 13 kbps and 12.2,kbps. 

A GSM network’s base station, controller (BSC) will typically service several tens of base 
transceiver stations.,The role of the BSC is to allocate BTS radio channels to mobile 
subscribers, perform paging (finding the,cell in.which a mobile user is resident), and perform 
handoff of mobile users; 

The base’ station controller and its,controlled base transceiver stations collectively constitute 
a GSM base station system (BSS). 

The mobile) switching, center (MSC) plays the central role in user authorization and 
accounting (e.g:, determining whether a mobile device is allowed to connect to the cellular 
network), call establishment and teardown, and handoff. 

A single MSC will typically contain up to five BSCs, resulting in approximately 200K 
subscribers per MSC. 

A cellular provider’s network will have a number of MSCs, with special MSCs known as 
gateway MSCs connecting the provider’s cellular network to the larger public telephone 


network. 
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3G Cellular Data Networks 
e = The 3G core cellular data network connects radio access networks to the public Internet. 
e Principle adapted in 3G: leave the existing core GSM cellular voice network untouched, 


adding additional cellular data functionality in parallel to the existing cellular voice network. 


Public telephone 
network 
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Internet 
Radio Network 
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|< Radio Interface >| 
(WCDMA, HSPA) 





|< Radio Access Network >l< Core Network >| Public >| 
Universal Terrestrial Radio General Packet Radio Service Le 
‘Access Network (UTRAN) (GPRS) Core Network 
Key: 


Serving GPRS Gateway GPRS 
Support Node Support Node 
(SGSN) (GGSN) 


e There are two types of nodes in the 3G core network: Serving GPRS Support Nodes 
(SGSNs) and Gateway GPRS Support Nodes (GGSNs). (GPRS stands for Generalized 
Packet Radio Service) 

e An SGSN is responsible for delivering datagrams to/from the mobile nodes in the radio 
access network to which the SGSN is attached. 

e The SGSN interacts with the cellular voice network’s MSC for that area, providing user 
authorization and handoff, maintaining location (cell) information about active mobile nodes, 
and performing datagram forwarding between mobile nodes in the radio access network and 
aGGSN. 


e The GGSN acts as a gateway, connecting multiple SGSNs into the larger Internet. 
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3G Radio Access Network: The Wireless Edge 


The Radio Network Controller (RNC) typically controls several cell base transceiver stations. 
The RNC connects to both the circuit-switched cellular voice network via an MSC, and to the 
packet-switched Internet via an SGSN. 

It uses a CDMA technique known as Direct Sequence Wideband CDMA (DS-WCDMA) 
within TDMA slots. 

The data service associated with the WCDMA specification is known as HSP (High Speed 


Packet Access) and promises downlink data rates of up to 14 Mbps. 


4G Networks: Long Term Evaluation (LTE) 


The 4G Long-Term Evolution (LTE) standards has two important innovations over 3G systems: 


1. 


Evolved Packet Core (EPC) 

The EPC is a simplified all-IP core network that.unifies the separate circuit-switched cellular 
voice network and the packet-switched cellular data network. 

It is an “all-IP” network in,that both voice and data will be carried in IP datagrams. 

A key task of the EPC is)to manage network resources, to provide this high quality of service. 
The EPC also makes a clear.separation between,the network control and user data planes, 
with many of the mobility,support features. 

The EP€ allows multiple types,of radio access networks, including legacy 2G and 3G radio 


access networks, to attach to the core network. 


LTE Radio Access Network, 

LTE uses a combination of frequency division multiplexing and time division multiplexing 
on the downstream channel, known as orthogonal frequency division multiplexing (OFDM). 
In LTE, each active mobile node is allocated one or more 0.5 ms time slots in one or more of 
the channel frequencies. By being allocated increasingly more time slots a mobile node is 
able to achieve increasingly higher transmission rates. 

The maximum data rate for an LTE user is 100 Mbps in the downstream direction and 50 


Mbps in the upstream direction, when using 20 MHz worth of wireless spectrum. 
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Mobility nagement: Principle: 


A mobile node is one that changes its point of attachment into the network over time. 


Several dimensions of mobility: 


1. 


From the network layer’s standpoint, how mobile is a user? 
A physically mobile user will present a very different set of challenges to the network layer, 
depending on how he or she moves between points of attachment to the network. Below 


figure shows various level of mobility. 


No mobility High mobility 
| | | | 
User moves only User moves between User moves between 
within same wireless access networks, access networks, 
access network shutting down while while maintaining 
moving between ongoing connections 
networks 


How important is it for the mobile node’s address to always remain the same? 

With mobile telephony the network-layer address,of phone remains the same as you travel 
from one provider’s mobile phone network to)another. 

If a mobile entity is able to maintain, its IPaddress as it moves, mobility becomes invisible 
from the application standpoint. There is great, value to this transparency—an application 
need not be concerned with a potentially changing IP address, and the same application code 
serves mobiléand nonmobile, connections alike, 

A less frequent mobile user might simply want to turn off an office laptop, bring that laptop 
home, power up, and work from home. If the laptop functions primarily as a client in client- 
server applications (e.g., send/read e-mail, browse the Web, Telnet to a remote host) from 
home, the particular IP address used by the laptop is not that important. 

What supporting wired infrastructure is available? 

It is assumed that there is a fixed infrastructure to which the mobile user can connect—for 
example, the home’s ISP network, the wireless access network in the office, or the wireless 
access networks lining the autobahn. What if no such infrastructure exists? If two users are 
within communication proximity of each other, can they establish a network connection in 


the absence of any other network-layer infrastructure? 
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Elements of Mobile Network Architecture 


Home network: Visited network: 
128.119.40/24 79.129.13/24 


Mobile node 








Permanent address: 
128.119.40.186 Permanent address: 
128.119.40.186 


‘Care-of address: 
79,129.13. 


Home agent ——~ Foreign agent 


wide area 
network 


Correspondent 


> Ina network setting, the permanent home of a mobile node,is knowmas the home network, 
and the entity within the home network that performs the mobility management functions of 
the mobile node is known as the home agent. 

> The network in which the mobile,node is currently residing is known as the foreign (or 
visited) network, and the entitywithin,the foreign network that helps the mobile node with 
the mobility management functions discussed below is known as a foreign agent. 


> Acorrespondent is the entity,wishing to communicate with the mobile node. 


Addressing 

In order for user mobility to be, transparent to network applications, it is desirable for a mobile 

node to keep\its address as it moves from one network to another. 

When a mobile node is,resident in a foreign network, all traffic addressed to the node’s 

permanent address now needs to be routed to the foreign network. 

One option is for the foreign network to advertise to all other networks that the mobile 

node is resident in its network. 

e The foreign network could simply advertise to its neighbors that it has a highly specific route 
to the mobile node’s permanent address. 

e These neighbors would then propagate this routing information throughout the network as 


part of the normal procedure of updating routing information and forwarding tables. 
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When the mobile node leaves one foreign network and joins another, the new foreign 
network would advertise a new, highly specific route to the mobile node, and the old foreign 
network would withdraw its routing information regarding the mobile node. 


Limitation: Scalability 


Another Approach is through care-of-address 


The conceptually simplest approach is to locate foreign agents at the edge routers in the 
foreign network. One role of the foreign agent is to create care-of address (COA) for the 
mobile node, with the network portion of the COA matching that of the foreign network. 
There are thus two addresses associated with a mobile node, its permanent address and its 
COA, sometimes known as a foreign address. 

A second role of the foreign agent is to inform the home agent that the mobile node is 


resident in its network and has the given COA. 


Routing to a Mobile Node 
1) Indirect Routing to a Mobile Node 










Home network: Visited network: 
128.119.40/24 79.129.13/24 
Mobile node 
Permanent address: ——) 
128.119.40.186 Permanent address: | 


128.119.40.186 & 
Care-of 
address: 
79.129.13.2 







Home Foreign 
agent Wide area agent 
network 





Correspondent 
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e In the indirect routing approach, the correspondent simply addresses the datagram to the 
mobile node’s permanent address and sends the datagram into the network, blissfully 
unaware of whether the mobile node is resident in its home network or is visiting a foreign 
network; mobility is thus completely transparent to the correspondent. 

¢ Home agent is responsible for interacting with a foreign agent to track the mobile node’s 
COA. 

e¢ Home agent second job is to be on the lookout for arriving datagrams addressed to nodes 
whose home network is that of the home agent but that aré)currently resident in a foreign 
network. The home agent intercepts these datagrams, and then forwards them to a mobile 
node in a two-step process. 

¢ The datagram is first forwarded to the foreign agent; using theymobile node’s COA, and then 
forwarded from the foreign agent to the mobile node. 

e Home agent encapsulate the correspondent’s original complete datagram within a new 
(larger) datagram. This larger datagram is,addressed and delivered to the mobile node’s 
COA. 

¢ The foreign agent, who“owns” the GOA, will receive and decapsulate the datagram—that is, 
remove the correspondent’s original datagram from within the larger encapsulating datagram 
and forward the original datagram to the mobile node. 

e Since the mobile node knows, the correspondent’s address, there is no need to route the 


datagram back through the home agent. 


dest: 79.129.13.2 dest! 128.119-40.186 |) 










Permanent address: 
128.119.40.186 Permanent address: 
128.119.40.186 


Care-of address: 
79.129.13.2—~ 


Foreign 
agent agent 


dest: 128.119.40.186 
dest: 128.119.40.186 


Correspondent 
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Indirect routing requires: 

e A mobile-node-to—foreign-agent protocol: The mobile node will register with the foreign 
agent when attaching to the foreign network. Similarly, a mobile node will deregister with 
the foreign agent when it leaves the foreign network. 

e A foreign-agent—to—-home-agent registration protocol: The foreign agent will register the 
mobile node’s COA with the home agent. A foreign agent need not explicitly deregister a 
COA when a mobile node leaves its network, because the subsequent registration of a new 
COA, when the mobile node moves to a new network, will take care of this. 

e A home-agent datagram encapsulation protocol: Encapsulation and forwarding of the 
correspondent’s original datagram within a datagram addressed to the COA. 

e A foreign-agent decapsulation protocol: Extraction of the correspondent’s original 
datagram from the encapsulating datagram, and the forwarding of the original datagram to 


the mobile node. 


2) Direct Routing to a Mobile Node 

Limitation of indirect routing: 

The indirect routing approach suffers from an inefficiency known as the triangle routing 
problem—datagrams, addressed to)the mobile node must be routed first to the home agent and 
then to the foreign network, even when a)much more efficient route exists between the 
correspondent and themobilenode. 

In the worst case, imagine amobile user who is visiting the foreign network of a colleague. 

The two are Sitting side by side and)exchanging data over the network. Datagrams from the 
correspondent, (in this case)the colleague of the visitor) are routed to the mobile user’s home 


agent and then back again\to the foreign network. 


Solution: 

e Direct routing overcomes the inefficiency of triangle routing, but does so at the cost of 
additional complexity. 

e In the direct routing approach, a correspondent agent in the correspondent’s network first 


learns the COA of the mobile node. This can be done by having the correspondent agent 
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query the home agent, assuming that the mobile node has an up-to-date value for its COA 
registered with its home agent. 

e It is also possible for the correspondent itself to perform the function of the correspondent 
agent, just as a mobile node could perform the function of the foreign agent. (step 1 and step 
2 of below figure). 

© The correspondent agent then tunnels datagrams directly to the mobile node’s COA, in a 


manner analogous to the tunneling performed by the home agent, (steps 3 and 4 ) 


Home network: Visited network: 
128.119.40/24 79.129,13/24 
Mobile node 









Permanent address: 
128.119.40.186 Permanent address: 
128.119.40.186 
SED care-of address: 
Home Foreign 79-129-13.2 
agent @ Wide area agent 
network 
@ 
Correspondent 
agent 
Correspondent 


Key: 
Control messages 
—> Data flow 


While direct routing overcomes the triangle routing problem, it introduces two important 

additional challenges: 

e A mobile-user location protocol is needed for the correspondent agent to query the home 
agent to obtain the mobile node’s COA. 

e Suppose data is currently being forwarded to the mobile node in the foreign network where 
the mobile node was located when the session first started (Step 1).We’ll identify the foreign 


agent in that foreign network where the mobile node was first found as the anchor foreign 
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agent. When the mobile node moves to a new foreign network (step 2), the mobile node 
registers with the new foreign agent (step 3), and the new foreign agent provides the anchor 
foreign agent with the mobile node’s new COA (step 4). When the anchor foreign agent 
receives an encapsulated datagram for a departed mobile node, it can then re-encapsulate the 
datagram and forward it to the mobile node (step 5) using the new COA. If the mobile node 
later moves yet again to a new foreign network, the foreign agent in that new visited network 


would then contact the anchor foreign agent in order to set up forwarding to this new foreign 






network. 
Home network: 
Foreign network 
being visited at 
session start: 
Anchor 
foreign 
a Wide area agent 
agent network 
New foreign 
network: 
SS Correspondent 
agent 
Correspondent New foreign agent 
Mobile IP 


Mobile IP is a)flexible)standard, supporting many different modes of operation (for example, 

operation with or without a foreign agent), multiple ways for agents and mobile nodes to 

discover each other, use of single or multiple COAs, and multiple forms of encapsulation. 

The mobile IP standard consists of three main pieces: 

e Agent discovery: Mobile IP defines the protocols used by a home or foreign agent to 
advertise its services to mobile nodes, and protocols for mobile nodes to solicit the services 


of a foreign or home agent. 
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Registration with the home agent: Mobile IP defines the protocols used by the mobile node 
and/or foreign agent to register and deregister COAs with a mobile node’s home agent. 

Indirect routing of datagrams: The standard also defines the manner in which datagrams 
are forwarded to mobile nodes by a home agent, including rules for forwarding datagrams, 


rules for handling error conditions, and several forms of encapsulation 


Agent Discovery 


A mobile IP node arriving to a new network, whether attaching to a foreign network or 
returning to its home network, must learn the identityof the corresponding foreign or home 
agent. Indeed it is the discovery of a new foreign agent, with a new network address, that 
allows the network layer in a mobile node to learn, that it has moved into a new foreign 
network. This process is known as agent discovery. 

Agent discovery can be accomplished in one.of two ways: via agent advertisement or via 
agent solicitation. 

With agent solicitation, a mobile node wanting to learn about agents without waiting to 
receive an agent advertisement can broadcast an agent solicitation message, which is simply 
an ICMP message with type value.10. An agent receiving the solicitation will unicast an 
agent advertisement directly to the mobile node. 

With agent advertisement, a foreign or home agent advertises its services using an ICMP 


type=9 and code=0)message. 





° 8 16 24 _ 
Type=9 — Code=0 Checksum 
L standard 
Router address ICMP fields 
Type=16 Length Sequence number 
Registration lifetime RBHINGIT Reserved 
[Mobility agent 
0 or more care-of addresses advartirenant 
extension 
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Important fields in agent advertisement are: 


Home agent bit (H): Indicates that the agent is a home agent for the network in which it 
resides. 

Foreign agent bit (F): Indicates that the agent is a foreign agent for the network in which it 
resides. 

Registration required bit (R): Indicates that a mobile user in this network must register 
with a foreign agent. 

M, G encapsulation bits: Indicate whether a form of encapsulation willbe used. 

Care-of address (COA) fields: A list of one or more care-of addresses, provided by the 


foreign agent. 


Registration with the Home Agent 


Once a mobile IP node has received a COA)that address must beregisteredwith the home agent. 


This involves following four steps. 


Step 1: 


Following the receipt of,a foreign agent adyertisement, a mobile node sends a mobile IP 
registration message to the foreign agent. 

The registration message,is carried within a UDP datagram and sent to port 434. 

The registration message carries a COA advertised by the foreign agent, the address of the 
home agent (HA), the permanent address)of the mobile node (MA), the requested lifetime of 
the registration, and a 64-bit registration identification. 

The requested registration lifetime is the number of seconds that the registration is to be 
valid. If the registration is not renewed at the home agent within the specified lifetime, the 


registration will become invalid. 


Step 2: 


The foreign agent receives the registration message and records the mobile node’s permanent 
IP address. 


The foreign agent then sends a mobile IP registration message to home agent. 


Step 3: 


The home agent receives the registration request and checks for authenticity and correctness. 
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The home agent binds the mobile node’s permanent IP address with the COA; in the future, 


datagrams arriving at the home agent and addressed to the mobile node will now be 


encapsulated and tunneled to the COA. 


registration lifetime, and the registration 
with this reply. 
Step 4: 


The home agent sends a mobile IP registration reply containing the HA, MA, actual 


identification of the request that is being satisfied 


The foreign agent receives the registration reply and then forwards it to the mobile node. 


Home agent 
HA: 128.119.40.7 


SS 


COA: 79. 


Registration req. 


COA: 79.129.13.2 
HA‘128.119.40.7 
MA: 128.119.40.186 
Lifetime: 9999 





| Registration reply 











HA: 128.119.40.7 
MA: 128.119.40.186 
Lifetime: 4999 
identification: 714 
encapsulation format 








Time 


Foreign 
1.129.13.2 


Ss 


Time 


Visited network: 
79.129.13/24 
Ll 
[ 1 
Mobile agent 
MA: 128.119.40.186 





agent 


ICMP agent adv. 
COA: 79.129.13.2 










Registration req. 


COA: 79.129.13.2 
HA‘128.119.40.7 
MA: 128.119.40.186 
identification: 714 








Registration reply 


HA: 128.119.40.7 
MA: 128.119.40.186 








Time 
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Indirect Routing: 
Correspondent sends data to home agent. Home agent encapsulates the data and forwards it to 


foreign agent using COA. Foreign agent delivers the data to mobile node. 


Managing Mobility in Cellular Networks 





Like mobile IP, GSM adopts an indirect routing approach, first routing the correspondent’s call 

to the mobile user’s home network and from there to the visited network. 

In GSM terminology, the mobile users’s home network is referred)to as the mobile user’s home 

public land mobile network (home PLMN) or home network. 

The home network is the cellular provider with whichthe mobile user has a subscription. 

The visited PLMN, which will be refered as the visited,network}is the network in which the 

mobile user is currently residing. 

Home network has home MSC (home agent)and MSC in visited network is visited MSC 

(visited agent). 

Each MSC maintain two databases: 

1. Home location register, (HLR): HLR contains the permanent cell phone number and 
subscriber profile information for each ofits subscribers. Importantly, the HLR also contains 
information about the current locations of these subscribers. 

2. Visitor docation register (VLR): The VLR contains an entry for each mobile user that is 
currently in the portion of the network served by the VLR. VLR entries thus come and go as 


mobileusers enter and leave thenetwork. 


Routing Calls to aMobile User 

1. The correspondent dials the mobile user’s phone number. The leading digits in the number 
are sufficient to globally identify the mobile’s home network. The call is routed from the 
correspondent through the PSTN to the home MSC in the mobile’s home network. 

2. The home MSC receives the call and interrogates the HLR to determine the location of the 
mobile user. In the simplest case, the HLR returns the mobile station roaming number 


(MSRN), or roaming number. 





Abhijith H V, Asst. Prof., Dept. of ISE, SVIT 15 


Module — 4: Wireless and Mobile Network 





3. The call is completed, being routed from the correspondent to the home MSC, and from there 


to the visited MSC, and from there to the base station serving the mobile user. 






Correspondent 


Public switched 
telephone 
network 


ob elas 


i] & Visited 


network 





Handoffs in GSM 
e A handoff occurs when)a mobile station changes its association from one base station to 
another during a call. 
¢ There may be several reasons forhandoff to o¢cur, including 
1. Thessignal between the current base station and the mobile may have deteriorated to such 
an extentthat the call isin danger of being dropped 


2. “Avcell may have become overloaded, handling a large number of calls. 








Abhijith H V, Asst. Prof., Dept. of ISE, SVIT 16 


Module — 4: Wireless and Mobile Network 





Steps: 


1. 


The old base station (BS) informs the visited MSC that a handoff is to be performed and the 
BS (or possible set of BSs) to which the mobile is to be handed off. 

The visited MSC initiates path setup to the new BS, allocating the resources needed to carry 
the rerouted call, and signaling the new BS that a handoff is about to occur. 

The new BS allocates and activates a radio channel for use by the mobile. 

The new BS signals back to the visited MSC and the old BS that the\visited-MSC-to-new-BS 
path has been established and that the mobile should be informed of the impending handoff. 
The new BS provides all of the information that the mobile willeed toyassociate with the 
new BS. 

The mobile is informed that it should perform a handoff. Note that up until this point, the 
mobile has been blissfully unaware that the network has;been laying the groundwork (e.g., 
allocating a channel in the new BS and allocating a path from the visited MSC to the new 
BS) for a handoff. 

The mobile and the new BS exchange jone or more)messages to fully activate the new 
channel in the new BS. 

The mobile sends a handoff complete message,to the new BS, which is forwarded up to the 
visited MSC. The visited MSC then reroutes the ongoing call to the mobile via the new BS. 


The resources allocated along,the path to the old)BS are then released. 


Inter-MSC handoff 


The anchor MSC isthe MSC visited by the mobile when a call first begins; the anchor MSC 
thus remains unchanged:during the call. 

Throughout \the call’s. duration and regardless of the number of inter-MSC transfers 
performed by the mobile, the call is routed from the home MSC to the anchor MSC, and then 
from the anchor MSC to the visited MSC where the mobile is currently located. 

When a mobile moves from the coverage area of one MSC to another, the ongoing call is 
rerouted from the anchor MSC to the new visited MSC containing the new base station. 
Thus, at all times there are at most three MSCs (the home MSC, the anchor MSC, and the 


visited MSC) between the correspondent and the mobile. 
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Rather than maintaining a single MSC hop from the anchor MSC to the current MSC, an 
alternative approach would have been to simply chain the MSCs visited by the mobile, 
having an old MSC forward the ongoing call to the new MSC each time the mobile moves to 


anew MSC. 


Home network Correspondent Home network Correspondent 






Anchor 
Msc 





a. Before handoff b. After handoff 


Wireless.and Mobility: Impact onHigher- Layer Protocols 





Network layer provides the)same best-effort delivery service model to upper layers in both 
wired and wireless networks. 

Similarly, if protocols, such as TCP or UDP are used to provide transport-layer services to 
applications, in,both wired and wireless networks, then the application layer should remain 
unchanged as well. 

The TCP and UDP can operate in networks with wireless links. On the other hand, transport 
protocols in general, and TCP in particular, can sometimes have very different performance 
in wired and wireless networks. 

TCP retransmits a segment that is either lost or corrupted on the path between sender and 
receiver. In the case of mobile users, loss can result from either network congestion or from 


handoff. In all cases, TCP’s receiver-to-sender ACK indicates only that a segment was not 
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received intact; the sender is unaware of whether the segment was lost due to congestion, 

during handoff, or due to detected bit errors. In all cases, the sender’s response is the same— 

to retransmit the segment. 

¢ Bit errors are much more common in wireless networks than in wired networks. 

e Given high bit error rates on wireless links and the possibility of handoff loss, TCP’s 
congestion-control response could be problematic in a wireless setting. Three broad classes 
of approaches are possible for dealing with this problem: 

1. Local recovery: Local recovery protocols recover from)bit errors when and where they 
occur. 

2. TCP sender awareness of wireless links: In the local recovery approaches, the TCP 
sender is blissfully unaware that its segments aretraversing,a wireless link. An alternative 
approach is for the TCP sender and receiver to be aware of the existence of a wireless 
link, to distinguish between congestive losses occurring in the, wired network and 
corruption/loss occurring at the wireless link, and to invoke, congestion control only in 
response to congestive wired-network losses. 

3. Split-connection approaches: In a split-connection approach the end-to-end connection 
between the mobile user and, the other end point is broken into two transport-layer 
connections:)one from the mobile, host to the wireless access point, and one from the 
wireless access point to the other. communication end point. The end-to-end connection is 
thus formed by.the concatenation ofa wireless part and a wired part. 

¢ Wireless links often have relatively low bandwidth. As a result, applications that operate over 
wireless links, particularly over cellular wireless links, must treat bandwidth as a scarce 


commodity. 
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Network Security 


Overview of Network Security 


e Network security is required by the users to communicate on the network. 
e If medium is insecure then an intruder may intercept, read and modify the transmitted-data 


from sender to receiver. 


Elements of Network Security 
1) Confidentiality: Information should be available only to those who have rightful access to it 
2) Authenticity and integrity: The sender of a message and the message itself should be 


verified at the receiving-point 





Tam User 1 


Key 






User I User 2 User 


f= Tam User 1 B 


| | 
\ | 
| | 
| (a 
' y 


Incruder Tncruder 
(a) ) 
(a) Message content and sender identity falsified by intruder; (b) a method of applied security 
e In figure a, user | sends a message ("i am user 1") to user 2. Since the network lacks any 
security system, an intruder can receive the message and change its content to a different 
message ("hi i am user 1") and send it to user 2. User 2 may not know that this falsified 
message is really from user | (authentication). 
e In figure 10.1b, a security block is added to each side of the communication, and a secret key 
that only users 1 and 2 would know about is included. Therefore, the message is changed to a 


form that cannot be altered by the intruder. 


Threats To Network Security 
Internet infrastructure attacks are broadly classified into 4 categories 


1) DNS hacking 
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2) Routing table poisoning 
3) Packet mistreatment 


4) Denial of Service (DOS) 


DNS HACKING ATTACKS 
+ DNS server is a distributed hierarchical and global directory that translates domain names 
into numerical IP address. 
+ DNS is a critical infrastructure, and all hosts contact DNS to access servers and start 
connections. 
+ Name-resolution services in the modern Internet environment are essential for email 
transmission, navigation to web sites, or data transfer. Thus, an attack on DNS can 
potentially affect a large portion of the Internet. 
+ A DNS hacking attack can appear in any of the following forms 
1) Masquerading Attack: The attacker poses as a trusted entity and obtains all the secret 
information. The attacker can stop any message from being transmitted further or can 
change the content or redirect the packet to bogus servers. This action is also known as a 
middle-man attack. 

2) Domain Highjacking Attack: Whenever a user enters a domain address, he is forced to 
enter into the attacker's Web site. 

3) Information Leakage Attack: The attacker sends a query to all hosts identifies which IP 
addresses are not used and uses those IP address to make other types of attacks 

4) Information-Level Attack( Cache Poisoning): This forces a server to correspond with 
other than the correct answer. The hacker tricks a remote name-servers into caching the 
answer for a third-party domain by providing malicious information and redirects traffic 


to a preselected site. 


ROUTING TABLE POISONING 
* This is the undesired modification of routing tables. This results in a lower throughput of the 
network. 


+ Two types of attacks are: i) link attack and ii)router attack. 
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Link Attack 

+ Link attack occurs when a hacker gets access to a link and thereby intercepts, interrupts or 
modifies routing messages. This act similarly on both the link-state and the distance-vector 
protocols. 

+ If an attacker succeeds in placing an attack in a link-state routing protocol, a router may send 
incorrect updates about its neighbors or remain silent even if the link state of its neighbor has 


changed 


Router Attack 

+ Router Attack may affect the link-state protocol or even the distance-vector protocol. 

+ In link-state protocol, if routers are attacked, they become malicious. As a result, routers may 
add a non existing link to a routing table delete an existing link or change the cost of a link. 

+ In the distance-vector protocol, an attacker may cause routers to send wrong updates about 


any node in the network, thereby misleading a router and resulting in network problems. 


PACKET MISTREATMENT ATTACKS 


+ Packet mistreatment attacks can occur during any data transmission. 

+ A hacker may capture certain data packets and mistreat them. 

* The attack may result in congestion lowering throughput & DOS attacks 

+ Link-attack causes interruption, modification or replication of data packets. Whereas, a 
router-attack can misroute all packets and may result in congestion or DOS 

Following are some examples: 

1) Interruption: If an attacker intercepts packets, they may not be allowed to be propagated to 
their destinations. 

2) Modification: Attackers may succeed in accessing the content of a packet. They can then 
change the address of the packet or change the data of the packet. This kind of attack can be 
detected by digital signature mechanism. 

3) Replication: An attacker may trap a packet and duplicate it. This kind of attack can be 
detected by using the sequence number for each packet. 

4) Malicious Misrouting of Packets: A hacker may attack a router and change its routing table, 


resulting in misrouting of data packets. 
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5) Ping of death: An attacker may send a ping message, which is large and therefore must be 
fragmented for transport. The receiver then starts to reassemble the fragments as the ping 


fragments arrive. The total packet length becomes too large and might cause a system crash. 


DOS ATTACKS (DENIAL OF SERVICE) 
+ DOS isa type of security breach that prohibits a user from accessing normally provided 
services. 
+ DOS can cost the target person a large amount of time and money. 
+ DOSaffects the destination rather than a data-packet or router. 
+ They take important servers out of action for few hours, thereby denying service to all users. 
Two types of attacks are: 
1) Single-source: An attacker sends a large number of packets to a target system to 
overwhelm & disable it 
2) Distributed: A large number of hosts are used to flood unwanted traffic to a single target. 


The target cannot then be accessible to other users in the network. 


Overview of Security Methods 


Common solutions that can protect computer communication networks from attacks are 


classified are cryptographic techniques or authentication techniques(verification). 


Cryptographic Techniques 

+ Cryptography is the process of transforming a piece of information or message shared by two 
parties into some sort of code. 

+ The message is scrambled before transmission so that it is undetectable by outside watchers. 

+ The scrambled-message needs to be decoded at the receiving-end before any further 
processing. 

+ The main tool used to encrypt a message M is a secret-key K. 

+ The fundamental operation used to encrypt a message is the exclusive-OR( © ). 

+ Assume that we have one-bit M and a secret-bit K. A simple encryption is carried out using 


M&K. 
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+ To decrypt this message, the second party can detect M by performing the following 
operation: Me K) S K=M 

* In end-to-end encryption, secret coding is carried out at both end systems. In link encryption, 
all the traffic passing over that link is secured. 

* Two types of encryption techniques are secret-key & public-key encryption 
1) In secret-key model, both sender & receiver conventionally use same key for an 

encryption process. 

2) In public-key model, a sender and a receiver each use a different key. 

e The public-key system is more powerful than the secret key system & provides better 
security and message privacy. 


Drawbacks of public-key system: slow speed and more complex computationally 





Link Encryption 
‘ 


End-to-End Encryption \. 
“ \ 





fees enn Backbone Network - ----—-— > 
' ' 


Authentication Techniques 

Encryption methods offer the assurance of message confidentiality. A networking-system must 
be able to verify the authenticity of the message and the sender of the message. These forms of 
security techniques are known as authentication techniques. 

Authentication techniques are categorized as 

i) authentication with message digest 


ii) authentication with digital signature. 
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Secret Key Encryption Protocols 

* This is also called as symmetric encryption or single-key encryption. 

+ Sender and receiver conventionally use the same key for an encryption process. 

+ This consist of an encryption-algorithm, a key and a decryption-algorithm 

+ The encrypted-message is called cipher text. 

* Two popular protocols are: 
1) DES (Data Encryption Standard) 
2) AES (Advanced Encryption Standard) 

+ Ashared secret-key between a transmitter and a receiver is assigned at the transmitter and 
receiver points. 

+ At the receiving end, the encrypted information can be transformed back to the original data 


by using decryption algorithm and secret key. 


DES (Data Encryption Standard) 

+ Plaintext messages are converted into 64-bit blocks & each block is encrypted using a key. 
+ The key length is 56 bits. 

+ DES consists of 16 identical rounds of an operation. 


Initial 
Permutation Round 1 Round 16 
L 





Plaintext 





Permutation 


fiom 


Begin DES Algorithm 
1) Initialize. Before round | begins, all 64 bits of the message and all 56 bits of the secret key 


are separately permuted (shuffled). 
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2) Each incoming 64-bit message is broken into two 32-bit halves denoted by Li and Ri 
respectively. 

3) The 56 bits of the key are also broken into two 28-halves, and each half is rotated one or two 
bit positions, depending on the round. 

4) All 56 bits of the key are permuted, producing version ki of the key on roundi. 

5) Liand Ri are determined by 
L=Rin 
and 
Ri = Lia © F(Ria,kKi) 

6) All 64 bits of a message are permuted. 

Operation of function F() 

+ Out of 56 bits of key ki, function F( ) chooses 48 bits. 

+ The 32-bit R i-1 is expanded from 32 bits to 48 bits so that it can be combined with 48-bit k i. 
The expansion of R j - 1 is carried out by first breaking R j - 1 into eight 4-bit chunks and then 
expanding each chunk by copying the leftmost bit and the rightmost bit from left and right 
adjacent chunks , respectively. 

+ F() also partitions the 48 bits of ki into eight 6-bit chunks. 

+ The corresponding eight chunks of Rj-; and eight chunks of kiare combined as follows 


Ris = Ri-l @ki 


AES (Advanced Encryption Standard) 

+ AES has better security strength than DES. 

+ In AES message is divided into 128-bit block, and it uses 128 or 192 or 256 bit key. 

+ Based on the key size number of rounds can be 10,12 or 14. 

+ The plaintext is formed as 16 bytes m0 through m15 and is fed into round | after an 
initialization stage. 

+ In this round, substitute-units(S) perform a byte-by-byte substitution of blocks. 

* The ciphers move through a permutation-stage to shift rows to mix-columns. 

+ At the end of this round, all 16 blocks of ciphers are Exclusive-ORed with the 16 bytes of 
round | key k0(1) through k15(1). 
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Initialization Round 1 Round 10 
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Public Key Encryption Protocols 


* This is also called as asymmetric or two key encryption. 
+ A sender/receiver pair use different keys. 
* This is based on mathematical functions rather than on substitution or permutation. 
* Two popular protocols are: 
i) RSA protocol 
ii) Diffie-Hillman key-exchange protocol. 
+ Either of the two related keys can be used for encryption; the other one for decryption. 
+ Each system publishes its encryption key by placing it in a public-register & sorts out key as 
public one. The companion key is kept private. 
+ If A wishes to send a message to B, A encrypts the message by using B's public key. 
* Atreceiving end, B decrypts the message by using its private key. 
+ No other recipients can decrypt the message, since only B knows its private key. 
+ The public-key system is more powerful than the secret key system & provides better 
+ Drawbacks of public-key system: 
co slow speed 


© more complex computationally 


RSA ALGORITHM 
+ Assume that a plaintext m must be encrypted to a cipher text c. 


+ This has three phases: key generation, encryption and decryption. 
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Key Generation Algorithm 
1) Choose two prime numbers a and b and compute n=a.b 
2) Find x. Select encryption-key x such that x and (a-1)(b-1) are relatively prime. 
3) Find y. Calculate decryption-key y. 
xy mod (a-1)(b-1) = 1 
4) At this point, a and b can be discarded. 
5) The public key = {x, n} 
6) The private key = {y, n} 
Encryption 
1) Both sender and receiver must know the value of n. 
2) The sender knows the value of x and only the receiver knows the value of y. 
3) Ciphertext c is constructed by 
c=m* mod n 
Decryption 
Given the ciphertext c, the plaintext m is extracted by 


m=c’ mod n. 


DIFFIE-HILLMAN KEY-EXCHANGE PROTOCOL 
+ Two end users can agree on a shared secret-code without any information shared in advance. 
+ This protocol is normally used for VPN (virtual private network). 
+ Assume that user-1 wishes to communicate with user-2. 
Key Generation Algorithm 
1) User-1 selects a prime number 'a’, random integer number 'x1', and a generator 'g'. Then 
creates 'y1' such that 
y! =g*! moda 
2) User-2 performs the same function and creates y2 such that 
y=g? 


3) User-1 then sends y1 to user-2. Now, user-1 forms its key k1 using the information its partner 





mod a 


sent as 
ki =y2*! moda 


4) User-2 forms its key ka using the information its partner send it as 
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ko =y1* moda 
5) The two keys k1 and k2 are equal. The two users can now encrypt their messages, each using 


its own key 


AUTHENTICATION 

+ Message-authentication verifies the authenticity of both the message-sender and themessage- 
content. 

+ Message-sender is authenticated through implementation of a digital signature. 

+ Message-content is authenticated through implementation of a hash function and encryption 
of the resulting message-digest. 

+ Hash-function is used to produce a "fingerprint" of a message. 

+ The hash-value is added at the end of message before transmission. 

+ The receiver re-computes the hash-value from the received message and compares it to the 
received hash value. 

+ If the two hash-values are the same, the message was not altered during transmission. 

* Once a hash-function is applied on a message m, the result is known as a message-digest 
h(m). 

+ The hash-function has the following properties 
1) Unlike the encryption-algorithm, the authentication algorithm is not required to be 
reversible. 
2) Given a message-digest h(m),it is computationally infeasible to find m. 
3) This is computationally infeasible to find two different messages m1 and m2 such that 
h(@m1)=h(m2). 


+ Message-authentication can be implemented by two methods. 


1) In first method, a hash-function is applied on a message and then a process of encryption is 
implemented. At the receiver site, the received message-digest is decrypted and the comparison 
is made between the decrypted h(m) and the message-digest made locally from the received 
message. compare it with the one made locally at its site for any judgments on the integrity of the 


message. 
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2) In second method, no encryption is involved. The two parties share a secret key. Hence, at the 
receiving site, the comparison is made between the received h(m) and the message-digest made 


locally from the received message. 





User 1 





Secure Hash Algorithm (SHA) 

+ The Secure Hash Algorithm (SHA) was proposed as part of the digital signature standard. 
SHA-1, the first version of this standard, takes messages with a maximum length of 2 4 and 
produces a 160-bit digest. 

+ With this algorithm, SHA-1 uses five registers, R ; through R 5 , to maintain a "state" of 20 
bytes. 

+ The first step is to pad a message m with length | m . The message length is forced to | m = 448 
mod 512. In other words, the length of the padded message becomes 64 bits less than the 
multiple of 512 bits. 

+ After padding, the second step is to expand each block of 512-bit (16 32 bits) words {mo, m1, 
«+, M5 } to words of 80 32 bits using: 

w; =m; for0 <i < 15 

And 

W; = W;_3 B W;_g O W;_14 ® W;_16 — 1 for 16 <i <79, 
where jHheans left rotation by j bits. 

+ Then, the 80 steps (i = 0, 1, 2, ..., 79) of the four rounds are described as follows 
5 =(R, 5) + F)(Ro, R3, Rg) + R5 + wj) +C; 
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R; = Ry 
Ry=R; 
R;= R, <= 30 
Ry =R, 
R,=6, 


Where C j is a constant value specified by the standard for round i . 


(aN b)U (ane) 0<i<19 
Kiab a= a@be@c 20< <39— 
saint (aNb)U(aNe)U(bNe) 40<i1 <59 
a@bec 60 <i <79 


The message digest is produced by concatenation of the values in R through R s. 


Authentication and Digital Signature 

+ A digital signature on a message is required for the authentication and identification of the 
right sender. 

+ RSA algorithm can be used to implement digital signature. 

+ The message is encrypted with the sender's private key. Thus, the entire encrypted message 
serves as a digital signature. 

+ At the receiving end, the receiver can decrypt the message using the public key. This 


authenticates that the packet comes from the right user. 


Firewalls 


+ Firewall is placed between hosts of a certain network and the outside world. 

+ Firewall is used to protect the network from unwanted web sites and potential hackers. 

+ The main objective is to monitor and filter packets coming from unknown sources. 

* Firewall can also be used to control data traffic. 

+ Firewall can be a software program or a hardware device. 
1) Software firewalls can be installed in home computers by using an Internet connection 
with gateways. 


+ 2) Hardware firewalls are more secure than software firewalls are not expensive. 
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A firewall controls the flow of traffic by one of the following three methods: 
1) Packet filtering: A firewall filters those packets that pass through. If packets can get 
through the filter, they reach their destinations: otherwise, they are discarded 
2) A firewall filters packets based on the source IP address. This filtering is helpful when a 
host has to be protected from any unwanted external packets. 


3) Denial of Service (DOS). This method controls the number of packets entering a 


network. 





Router 
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